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Abstract 

We review some theoretical and phenomenological aspects of massive gravities in 4 
dimensions. We start from the Fierz-Pauli theory with Lorentz-invariant mass terms 
and then proceed to Lorentz- violating masses. Unlike the former theory, some models 
with Lorentz-violation have no pathologies in the spectrum in flat and nearly flat 
backgrounds and lead to interesting phenomenology. 
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1 Introduction 



Recently, there has been revival of interest in attempts to construct models of gravity which 
deviate from General Relativity at ultra-large distance and time scales, that is, models 
with infrared-modified gravity. The general approach is to view these models as possible 
low energy limits of an unknown fundamental theory, and at exploratory stage not worry 
too much about issues like renormalizability, embedding into an ultraviolet-complete theory, 
etc. Yet there are other self-consistency problems that limit severely the classes of acceptable 
models, while phenomenology of remaining ones turns out to be rather reach and interesting. 

Besides pure curiosity, there were several original motivations for the recent increase 
of this activity. One of them has to do with the cosmological constant problem and the 
observational evidence for the accelerated expansion of the Universe at the present epoch 
(for reviews from theoretical prospective see, e.g., Refs. [1, 2, 3, 4, 5, 6, 7]). This accelerated 
expansion may well be due to the cosmological constant = vacuum energy density, new 
weakly interacting field or some other kind of dark energy, which contributes, according to 
Ref. [8], about 75% to the total energy density p c in the present Universe. The problem is 
that the value e A of the dark energy density is very small, 

e A ~ 0.75p c ~ 4 • 1(T 6 -^ T . 

cm 6 

This is by many orders of magnitude smaller than the values one would associate, on dimen- 
sional grounds, with fundamental interactions — strong, electroweak and gravitational, 

e A ~ lO- m t QCD ~ 10- 54 e w ~ 10- 123 e 3ra „. 

In other words, the energy scale characteristic of dark energy 1 , M A = e 1 / 4 ~ 10~ 3 eV, is much 
smaller than the energy scales of known interactions, Aqcd — 200 MeV, Mw — 80 GeV, 
Mpi ~ 10 19 GeV. The unnatural smallness of e A (or Ma) is precisely the cosmological 
constant problem. 

In fact, there are two parts of this problem. One is that the contributions from strong 
(QCD), electroweak and gravitational sectors to vacuum energy density should be of order 
of £qcd — Aqcdi € ew ~ M^r and e grav ~ Mp ; , respectively 2 . Thus, the first part of the 
cosmological constant problem is to explain why e A is essentially zero. The second part is to 
understand why e A is in fact non-zero, and what physics is behind the energy scale M A . 

1 Hereafter we set K = c = 1. 

2 As an example, QCD vacuum has complex structure — there are quark and gluon condensates whose 
values are determined by highly complicated (and largely unknown) dynamics and depend on QCD param- 
eters (Aqcd and quark masses) in a complicated way. The difference between the energy densities of this 
vacuum and the naive, perturbative one is certainly of order €qcd, so there is no reason whatsoever for the 
energy density of the physical vacuum to be 46 orders of magnitude smaller than eqcd- 
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It is not inconceivable that the first part of the dark energy problem may be solved 
by one or another mechanism that drives the cosmological constant to zero (for a review 
see, e.g., Ref. [9]); such a mechanism most probably would operate at a cosmological epoch 
which preceeded any known stage of the cosmological evolution, but at which the state of 
the Universe was quite similar to the present one 3 [10, 11]. 

On the other hand, despite numerous attempts, no compelling idea has been put forward 
of how the value of M\ may be related to other known fundamental energy scales. One 
possible viewpoint is that eA is actually the cosmological constant (time-independent quan- 
tity during the known history of our part of the Universe), and that its value is determined 
anthropically (for reviews see, e.g., Refs. [1, 12]): much larger values of |ca| would be incon- 
sistent with our existence. This viewpoint implies that the Universe is much larger than its 
visible part, and that e\ takes different values in different cosmo logically large regions; we 
happen to have measured small value of e\ merely because there is nobody in other places 
to measure (larger values of) the cosmological "constant". 

Another option is that the accelerated expansion of the Universe is due to new low energy 
(infrared) physics. Perhaps the best known examples are quintessence models (for reviews 
see, e.g., Refs. [2, 3, 4, 5, 6]), in which gravity is described by General Relativity while 
the accelerated expansion is driven by (dark) energy of a new super-weakly interacting field 
(most conventionally, but not necessarily, this field is Lorentz scalar). The original idea of 
infrared- modified gravity is that, instead, the gravitational laws get changed at cosmological 
distance and time scales, hopefully leading to the accelerated expansion without dark energy 
at all. This would certainly be an interesting alternative to dark energy, which might even 
be observationally testable. 

Another original motivation for infrared-modified gravities came from theories with brane- 
worlds and extra dimensions of large or infinite size (for a review see, e.g., Ref. [13]). In these 
theories, ordinary matter is trapped to a three-dimensional hypersurface (brane) embedded 
in higher- dimensional space. The idea [14, 15, 16] is that gravitons may propagate along 
"our" brane for finite (albeit long) time, after which they escape into extra dimensions. This 
would modify the brane-to-brane graviton propagator at large distances and time intervals, 
thus changing the gravitational interactions between particles on "our" brane. If successful, 
models with this property would provide concrete and calculable examples of the infrared 
modification of gravity. Again, this idea is very hard to implement in a self-consistent way, 
and the models constructed so far have their intrinsic problems. In this regard, it is worth 
mentioning that there are claims for an exception: it has been argued [17] that the Dvali- 
Gabadadze-Porrati "brane-induced gravity" model [18] (for a review see Ref. [19]) may be 
fully self-consistent despite the fact that on the face of it, this model becomes strongly 
coupled at unacceptably large distances [20, 21, 22]. Interestingly, the DGP model has 

3 A name suggested by G. Ross for this scenario is "deja vu Universe". 
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a self-accelerating branch of cosmological solutions [23, 24, 25], which, however, has phe- 
nomenologically unacceptable ghosts among perturbations about these solutions [20, 26]. 

Among other lines of thought we mention theories attempting to incorporate MOND [27, 
28, 29, 30, 31], which modify gravity for explaining rotation curves of galaxies without dark 
matter, and RTG [32, 33] motivated by the desire to restore the full generality of energy and 
momentum conservation laws. It remains to be seen whether these theories can be made 
fully self-consistent and phenomenologically acceptable. 

Recently, massive graviton has been motivated from quite a different prospective [34]. 
Namely, there is a fairly widespread expectation that Quantum Chromodynamics (QCD) 
may have a formulation in terms of a string theory of some sort. Known string theories, 
however, often have massless spin-2 state in the spectrum, while QCD does not. The argu- 
ment is that it is desirable to remove this state from the massless sector of string theory by 
giving it a mass. In terms of effective four-dimensional low energy theory, this task appears 
very similar to giving a mass to graviton. 

One naturally expects that the infrared modification of gravity may be associated with the 
modification of the dispersion law uo = uj(p) of metric perturbations at low spatial momenta 
p, the simplest option being the graviton mass. In this review we will mostly discuss this 
type of theories, and stay in 4 dimensions. We stress, however, that theories of this type 
by no means exhaust all possible classes of theories with infrared-modified gravity. Other 
classes include, e.g., scalar-tensor theories, in which the modification of gravity occurs due 
to the presence of extra field(s) (scalars), over and beyond the space-time metric, that are 
relevant in the infrared domain. There are examples of models belonging to the latter class 
that not only are phenomenologically acceptable but also lead to interesting cosmological 
dynamics, including the accelerated expansion of the Universe [35, 36, 37]. Another class 
of models involves condensates of vector and/or tensor fields, see, e.g., Refs. [38, 39] and 
references therein. The discussion of these and similar models is beyond the scope of this 
review. 

As it often happens, irrespectively of the original motivations, theoretical developments 
have lead to new insights. In the case of infrared-modified gravity and modified graviton 
dispersion law, these are insights into self-consistency issues, on the one hand, and phe- 
nomenological implications, on the other. The reason behind self-consistency problems is 
the lack of explicit invariance under general coordinate transformations (or non-trivial real- 
ization of these transformations). Indeed, unless extra fields are added to the gravitational 
sector of the theory, straightforward implementation of the requirement of this gauge sym- 
metry leads in a unique way to General Relativity (with cosmological constant) plus possible 
higher-order terms irrelevant in the infrared domain. Once this gauge symmetry is broken, 
explicitly or spontaneously, gravity gets infrared-modified, but new light degrees of freedom 
may appear among metric perturbations, over and beyond spin-2 gravitons. These new 



5 



degrees of freedom may be ghosts or tachyons, which is often unacceptable. Another dan- 
gerous possibility is that they may be strongly interacting at energy scales above a certain 
"ultraviolet" scale A UV - This would mean that the theory of gravity gets out of control at 
energies above A uv . If A uv is too low, and if the new degrees of freedom do not effectively 
decouple, the theory cannot be claimed phenomenologically acceptable. We will see that 
the problems of this sort are quite generic to theories admitting Minkowski background and 
having explicit Lorentz-invariance in this background. 

As far as four-dimensional models with infrared-modified gravity are concerned, avoiding 
the self-consistency problems is relatively easy if Lorentz-invariance is broken for excitations 
about flat background. The main emphasis of this review is on models of this type [40, 41, 
42, 43, 44]. Breaking of Lorentz-invariance is in fact quite natural in this context. Indeed, 
infrared modification of gravity may be thought of as an analog of the broken (Higgs) phase 
in gauge theories, gravity in a certain sense being the gauge theory of the Lorentz group. 
Gravity in the Higgs phase is thus naturally expected to be Lorentz- violating. We will discuss 
various aspects of theories of this type, including self-consistency, technical naturalness and 
phenomenology. The latter is quite interesting in a number of cases, as intuition gained in 
Lorentz-invariant field theories is often misleading when Lorentz-invariance does not hold. 
In the end of this review we come back to the issue of the accelerated expansion of the 
Universe. 

2 Fierz-Pauli model 

To understand better the problems arising when one attempts to modify the gravitational 
interaction at large distances, it is instructive to consider first the Lorentz-invariant massive 
gravity. The Lorentz-invariant graviton mass term was proposed by Fierz and Pauli [45] ; we 
will refer to the corresponding model as the Fierz-Pauli model. 

One may think of a theory, belonging to the class we discuss in this Section, in various 
ways. One may simply add graviton mass terms to the Einstein-Hilbert action, as we do 
in what follows. Equivalently, one may consider General Relativity interacting with extra 
massless fields (see, e.g., Refs. [46, 47] and references therein). Once these fields obtain 
background values which depend on space-time coordinates, general covariance is broken, and 
graviton gets a mass in a manner reminiscent of the Higgs mechanism. In either approach, 
one arrives at one and the same class of theories, provided that one imposes the following 
requirements: (i) Minkowski space-time is a legitimate background, i.e., flat metric solves 
the field equations 4 ; (ii) in Minkowski background, there are no light fields except for metric 

4 It is worth noting that massive gravities with legitimate backgrounds other than Minkowski may have 
properties quite different from those exposed in this Section. A well studied example is massive gravity 
about (anti-) de Sitter space [48]. 
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perturbations; (iii) Lorentz invariance is unbroken in this background. 

The issues which generically arise in modified gravities are: the ghosts/instabilities; 
the absence of the zero mass limit (the van Dam-Veltman-Zakharov discontinuity); strong 
coupling at parametrically low ultraviolet (UV) energy scale; the existence of a "hidden" 
Boulware-Deser mode which is not seen in the analysis of perturbations about Minkowski 
background but becomes propagating — and dangerous — once the background is curved, 
even slightly. In this Section we discuss these issues using the Fierz-Pauli model as an 
example. 



2.1 Lorentz-invariant massive gravity in Minkowski background 

Consider General Relativity plus the most general Lorentz-invariant graviton mass term 
added to the action. We parameterize nearly Minkowski metric as follows, 

We will sometimes need the expressions for g^ v and \f^g up to the second order, 

= i + ^ + ^a-^V, (i) 

where indices are raised and lowered with Minkowski metric i]^ u = diag(l, —1,-1,-1). At 
the quadratic level about Minkowski background, the general action for Lorentz-invariant 
massive gravity reads 

s = *4 l J d'x |4 2 i(V) + \ + - A H) 2 } > ( 2 ) 

where Mp t = l/(167rG), a and (5 are arbitrary coefficients of dimension [mass squared], while 
L E ' H is the standard graviton kinetic term 
latter can be written in the following form, 



(2) 

L EH is the standard graviton kinetic term coming from the Einstein-Hilbert action. The 



L ( eh = \ (dxhT d% u - 2d,h^ d x ht + IdJT d v h\ - d,K d»h x x ) . (3) 

When discussing metric perturbations, we will use the convention that the Lagrangian and 
the action are related by 

S = M 2 Pl [ d 4 x L . (4) 



This will simplify formulas; many of them will not include Mpi. 
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In what follows it will be convenient to use both the above Lorentz-covariant form of the 
Lagrangian and the form corresponding to the (3+1) decomposition. The metric perturba- 
tions in the latter formalism are traditionally parameterized as follows [49], 

h 00 = 2(p, 

hoi = Si + diB, 

hij = h?? -d i Fj-d j F i -2{il>5 i j-d i d j E). (5) 
Here hJ T is a transverse-traceless 3-tensor, 



9^ = 0, h™ = 0, 



IJ ' 11 

Si and Fi are transverse 3-vectors, 

diSi = diF t = , 

and other variables are 3-scalars; hereafter summation over spatial indices i,j = 1,2,3 is 
performed with Euclidean metric. Accordingly, the quadratic part of the Einstein-Hilbert 
term decomposes into tensor, vector and scalar parts, 

Leh = + + Leh, (6) 



where 



L ( Ph = \(d ^d h^-d k ^d k ^), (7) 
Leh = ^dkiSi + doF^d^Si + doFi), (8) 
Lf H = 2[d k ^d k ^-3d ^ + 2d k ( V -d B + d 2 E)d k ^]. (9) 

Likewise, the mass terms decompose as 

Lm= Ml t {^h^ (10) 

^{diFjdiFj - SiSi) (11) 

+ [{a + /% 2 + 2/3(3^ - AE)p + (a + (3){AE) 2 

-2{a + 3(3)ipAE + 3(a + 3/3)^ 2 + |^Afi] | . (12) 

Hereafter Lagrangians differing by total derivative are not distinguished. 

In General Relativity, most of the fields entering the Lagrangian L EH do not propagate: 
the only propagating degrees of freedom are conveniently parameterized by hf^, these are 
transverse traceless gravitational waves. This feature is of course a consequence of the gauge 
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invariance of General Relativity, with gauge transformations, at the linearized level about 
Minkowski background, having the form 



(13) 



where Cm( x ) are arbitrary functions of coordinates. 

Once the mass terms are added, gauge invariance is lost, and there emerge extra propagat- 
ing degrees of freedom. Indeed, one expects that massless spin-2 graviton will be promoted 
to massive spin-2 particle. While the former has two polarization states, the latter has five, 
with helicities ±2, ±1 and 0. In (3+l)-language this corresponds to two tensor modes, two 
vector modes and one scalar mode, respectively. For general a and f3, however, there is 
one more scalar mode, which is necessarily a ghost. Let us consider this issue, within the 
(3+l)-formalism first. 

In the tensor sector, hj^ remain two propagating degrees of freedom, whose mass is now 
given by 



To avoid tachyons, we consider the case a < in what follows 5 . The tensor modes have 
healthy kinetic term (7), so this sector is not problematic. 

The vector sector contains a non-dynamical field Si which enters the action without time 
derivatives. 

Let us pause at this point to discuss, in general terms, two kinds of non-dynamical fields. 
At the level of quadratic action, a non-dynamical field may enter the action either linearly 
or quadratically. An example of the latter situation is given by the vector sector of massive 
gravity: there is a term Sf in the Lagrangian (11), and also a term (dkSi) 2 in (8). In that 
case the non-dynamical field can be integrated out: the field equation obtained by varying 
this field can be used to express this field through dynamical fields (the latter enter the 
action with time derivatives), and then one gets rid of this field by plugging the resulting 
expression back into the action. The number of dynamical fields is not, generally speaking, 
reduced in this way (there are important exceptions in gauge-invariant theories, which we 
will encounter a number of times in this review) . 

Another possibility is that the action does not contain a term which is quadratic in non- 
dynamical field. This is the case, e.g., in the scalar sector of General Relativity, whose action 
(9) is linear in the field (and B, which is also a non-dynamical field, since, after integration 
by parts, it enters without time derivatives). Unlike in the quadratic case, the corresponding 
field equation is a constraint imposed on dynamical fields, and the non-dynamical field itself 
is a Lagrange multiplier. An important feature here is that the constraint reduces the number 
of dynamical fields, i.e., the number of degrees of freedom. 

5 We will see shortly that a > leads to even more severe problem in the vector sector of the model. 




= —a . 



(14) 
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This discussion is straightforwardly generalized to the case of several non-dynamical 
fields: if the part of the action, which is quadratic in these fields, is non-degenerate, all these 
fields belong to the first category, otherwise there are Lagrange multipliers whose number 
equals the degree of degeneracy. 

After this general remark, let us come back to the vector sector, and integrate out the 
field Its field equation is 

(A - m 2 G )Si = -A«9 F , 

where A is the 3-dimensional Laplacian. Expressing Si through Fj by making use of this 
equation, and plugging it back into the action, one obtains the action for the remaining field 
Fj. In massless theory, the latter action is identically zero, so that Fj does not have to obey 
any equation and thus is arbitrary. This arbitrariness is of course a consequence of the gauge 
freedom (13), in this case with transverse Q. With the mass terms added, the field Fj is 
dynamical. The Lagrangian for Fj, in 3-dimensional momentum representation, is 6 



T m l 

L F = — 



(15) 



2 Z ^oF;(p)doF(p) - P 2 i?(p)F(p) 



p z + TJIq 

To convert it into the standard form, one introduces canonically normalized field 



P 2 



F,(p) = M Pl m G J * 2 F(p) (16) 
y p 2 + m G 

and finds that the linearized action is 

Sr = J d 3 p l - [doFtdoFi - (p 2 + m G )TtT^ . 

Hence, the vector sector has two propagating degrees of freedom of mass m G (recall that Fj 
are transverse, so only two components of Fj are independent). The number of propagating 
degrees of freedom in the tensor and vector sectors corresponds to the number of helicity ±2 
and ±1 states of massive graviton, respectively, in accord with expectations. 

Let us note here that for m G = —a < the Lagrangian (15) has negative overall sign, 
so that vector modes are ghosts in that case. This is even worse than tachyon behaviour of 
tensor modes. 

Let us come back to the theory with m G = —a > 0. From (16) we see that the limit 
m G — > is singular. Fluctuations of the canonically normalized field Ti are finite in this 
limit, so fluctuations of the vector part of the metric, the field F, diverge as vn G x . At small 
but finite m G this results in the fact that the quantum theory becomes strongly coupled at 
an ultraviolet energy scale A uv which is much lower than M Pl . In the vector sector this 



6 Hereafter p denotes 3-momentum, while p is reserved for 4-momentum. 
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scale is not unacceptably low, however. We will discuss strong coupling later, as it becomes 
a problem in the scalar, rather than vector, sector. 

Let us now turn to the scalar sector, and consider first the general case 

a^O , a ^ -P , -2(3 . 

Upon integrating by parts, one obtains the following form of the Lagrangian, including mass 
terms, 

= 2 -2y?A^ - 2tpAB + 2?pAE - 3^ 2 - ipAip 
-(a + 3P)i>AE + 3 ° + 3| V + jBAB 



(17) 



In the case of General Relativity (a — (3 — 0), the fields tp and B are Lagrange multipliers, 
giving one and the same constraint ip = 0. Then the equation of motion obtained by varying 
■0 gives tp — B — E; varying E gives nothing new. There are no propagating degrees of 
freedom, while the fields B and E remain arbitrary. This is again due to the gauge freedom 
(13), now with Co 7^ and Q = d^L- 

In the massive case, the fields tp and B are no longer Lagrange multipliers, but they are 
still non-dynamical and can be integrated out. Integrating out B one obtains an additional 
term in the Lagrangian, 

L B = --M^ , (18) 
a 

while integrating out tp gives another additional term 

a + (3 

Then the Lagrangian for the remaining fields ip and E takes the form 



2i(jAE - 3ip 2 - vpA-ip + ^—^(AE) 2 - (a + 3(3)i(AE + 3 ^ ' — i? 

(20) 

Both fields tp and E are dynamical, so there are two propagating degrees of freedom in the 
scalar sector. Thus, there is an extra scalar mode over and beyond the expected helicity-0 
state of massive graviton. This degree of freedom is actually a ghost (negative sign of the 
kinetic term). 

To see this, let us concentrate on the terms with time derivatives. These come from the 
terms explicit in (20) and from the term (18). So, for given spatial momentum, the relevant 
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part of the Lagrangian has the form 

L kin = -ip 2 + BifjE 

= — (ip + —e\ 2 — —E 2 , (21) 
2 V A J 2A ' K J 

where A and B are numerical coefficients (depending on spatial momentum). One observes 
that irrespectively of the sign of A, one of the two degrees of freedom is a ghost. Of course, 
the theory is Lorentz-invariant, so both modes have u 2 = p 2 at high spatial momenta, and 
this ghost exists at arbitrarily high spatial momenta p. 

It is clear from (19) that the case a = —f3 is special 7 . This is precisely the Fierz-Pauli 
theory where 

(3 = —a = m 2 G . 

In this case there is no quadratic term in ip, so if is a Lagrange multiplier. The corresponding 
constraint is 

AE = 3*0 - -Atf). (22) 

P 

This constraint kills one degree of freedom out of two, so the only mode in the scalar sector 
is the helicity-0 state of massive graviton with normal (positive) sign of the kinetic term. 
Indeed, inserting (22) back into the action 8 (17) and adding the term (18) one finds that the 
only remaining degree of freedom is ip, and the Lagrangian has the kinetic term 

Lkin,4> = 6ip 2 - 

In fact, the complete quadratic Lagrangian for ip is 

L^ FP = 6 (d^ij - m 2 G ij 2 ) , (23) 

in full analogy with, e.g., the Lagrangian for tensor modes /i^ T . 

To end up this discussion, we make the following comment. Of course, in the case of 
Minkowski background and Lorentz-invariant mass terms, the analysis is most straightfor- 
wardly performed in Lorentz-covariant way. The (3+l)-formalism utilized here certainly 
looks as unnecessary complication. Our analysis, however, does provide useful insights. 



7 The cases a = and a = —2(3 are special too. For a = —2/3 the ghost has the same mass as the graviton, 
and because of this degeneracy, its wave function grows in time, i.e., behaves like t ■ exp(iu;t). For a — the 
graviton mass (14) is zero, and one can check that the only degrees of freedom of the linearized theory are 
transverse traceless massless gravitons. So, the theory with a = cannot be considered as massive gravity, 
and we will not discuss this case further. 

8 One may question whether this procedure is legitimate, since one of the field equations is apparently 
lost. This is not the case: in the original formulation, this would-be-lost equation is an equation determining 
the Lagrange multiplier ip in terms of ip. 
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First, it suggests that the problems of massive gravity are most severe in the scalar sector. 
Second, it shows that the Fierz-Pauli miracle occurs due to the special property of the field 
if, which acts as the Lagrange multiplier and kills the undesirable degree of freedom. We 
will see in Section 2.6 that this property is lost in curved backgrounds, and the extra degree 
of freedom — Boulware-Deser mode — reappears in the spectrum. Finally, our discussion 
suggests a possibility that Lorentz- violating mass terms may give rise to a healthier theory, 
provided they are chosen in such a way that the unwanted degree of freedom is killed in a 
consistent way. We will discuss this possibility in the following Sections. 

2.2 Ghost via Stiickelberg trick 

A convenient way to single out and study dangerous degrees of freedom is to make use of 
the Stiickelberg formalism [46]. The idea is to enlarge the field content of massive gravity 
in such a way that the gauge invariance is restored, and then make a judicious choice of 
gauge fixing condition. We will use this trick in various Sections of this review, and here 
we illustrate the Stiickelberg approach by re-obtaining the ghost in the spectrum about 
Minkowski background. 

Let us again consider the theory (2) with general mass terms. In linearized theory, one 
introduces a new, Stiickelberg field £ M and a new field by writing 

V = V + d^ u (x) + (x) . (24) 

Then the linearized theory is invariant under gauge transformations 

Importantly, because of the gauge invariance of General Relativity, the Einstein-Hilbert part 
of the quadratic action, Eq. (3), is independent of £ M , 

r(2) _ r(2) ( T \ 

Note that we did not introduce new degrees of freedom: by imposing the gauge condition 
^ = we get back to the original massive gravity. The trick is to impose a gauge condition 
on h^ u instead, and do so in such a way that all independent components of h^ v obtain 
non-trivial kinetic terms from the Einstein-Hilbert Lagrangian. This guarantees that the 
fields and £ M will decouple at high energies (for a ^ —13), and the properties of the 
dangerous modes will be read off from the Lagrangian involving the fields £ M only. We will 
further comment on this procedure later on, see Eqs. (33), (34). 

We are interested in relatively high energies and spatial momenta, u 2 , p 2 3> \a\, \(3\. We 
will thus keep the terms in the action which are of the highest order in derivatives. Because 
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of the structure of (24), these terms come not only from the Einstein-Hilbert part of the 
action but also from the mass terms. This is a peculiarity of the Stiickelberg formalism. 

The choice of the gauge condition for is not very important. One may think of the 
gauge Jiqq = 0, hoi = or of covariant gauges 9 . In either case, the remaining components 
of h^y have non-degenerate terms with two time derivatives, which come from the Einstein- 
Hilbert action. As an example, in the gauge froo = 0, hoi = 0, i.e., <p — 0, Si — 0, B — 0, 
the fields F iy ip and E have non-degenerate terms with two time derivatives, see (8) and (9). 
We will sometimes write, schematically, 

L% = {dhf. 

Kinetic terms for the field £ M come from the mass terms in the action (2). They have the 
following form, 

f(^) 2 +(f + P)W) 2 - (25) 

The mass terms induce also mixing between £ M and h^, but as we will discuss shortly, this 
mixing is unimportant for a ^ —{3 at high momenta and frequencies, p 2 , oj 2 ^> \a\, \/3\. Once 
this mixing is neglected, the fields and £ M decouple, as promised, so we can study the 
metric and Stiickelberg sectors, h^ u and independently. 

In the metric sector, the kinetic part of the Lagrangian is just the gauge fixed Einstein- 
Hilbert Lagrangian. Thus, the only propagating modes in this sector are hj^ ■ Other prop- 
agating modes belong to the Stiickelberg sector. Once mixing between and £ M is taken 
into account, the latter modes have no n- vanishing h^, but this effect is small and can be 
neglected. Let us see explicitly how this works in the gauge h 00 = 0, h 0i = 0. Consider, 
as an example, vector sector in (3+l)-decomposition language. The full Lagrangian in this 
sector is 

= ^d Fif + |(^F,) 2 - adiF 3 ■ d£j - f (cUf) 2 , (26) 

where we set Si = according to our gauge choice and £j is 3-dimensionally transverse, 
di£j = 0. The first term in (26) is the Einstein-Hilbert term (8), the last term comes from 
(25), while the third term is precisely the term that mixes metric and the Stiickelberg field. 
The field equations are 

h + < T - aF t = 0, (27) 
□C T + AF t = 0. (28) 

At to 2 , p 2 3> m 2 G = —a these equations may be solved perturbatively in a. To the zeroth 
order, the first of these equations has no oscillating solutions, so Fi = and the only 

9 Unlikc non-covariant gauges, covariant ones do not fix the gauge completely. There remain unphysical 
modes to worry about. 
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propagating modes are £[, as expected. These are helicity-1 states of massive graviton in 
the Stiickelberg picture. To the first order, one has from (27) 

f = —e 

so eq. (28) becomes 

2 

- p Ki + —j-ti = o. 

As promised, the second term here, which appears due to mixing between Fi and £[, is 
negligible for u 2 , p 2 3> tuq. 

The lesson from this exercise is twofold. First, it shows that neglecting the metric sector 
h^y is indeed legitimate (except for helicity-2 states) as long as modes with u 2 , p 2 ^> itlq 
are considered. Second, we see that the Stiickelberg formalism is useless for studying modes 
with uj 2 < rriQ] off hand, one might even loose some modes with uj 2 < m 2 G by considering 
the Stiickelberg sector only. Indeed, there is no guarantee that the system of equations like 
(27), (28) does not have more slowly oscillating solutions than equation O^f = 0; after all, 
equations (27), (28) are both second order in time. In the theory considered in this Section, 
the number of modes with high and low u is the same by Lorentz-invariance, but the last 
remark should be kept in mind when studying Lorentz-violating massive gravities. 

Let us come back to modes with u 2 , p 2 ^> itlq and consider the Stiickelberg sector. One 
may view the expression (25) as the generic Lorentz-invariant Lagrangian for the vector field 
£ M . It is well known that this Lagrangian has a ghost in the spectrum unless the two terms 
combine into field strength tensor F 2 V = (d^ u — d u ^) 2 . This occurs when a = —f3. Thus, 
we again see that the no-ghost situation is possible in the Fierz-Pauli case only. 

Ghosts are unacceptable in Lorentz-invariant theory. Therefore, we concentrate on the 
Fierz-Pauli theory in the rest of this Section. 

Let us see how the Stiickelberg analysis applies to the Fierz-Pauli case, (5 = —a = m^. 
In that case, the relevant part of the mass term reads 

77? 2 

-m 2 G (d u eK-d^K), (29) 

where we have omitted terms without derivatives but kept the kinetic mixing between h^ v 
and The 4-dimensionally transverse part of that obeys d^ r = 0, has healthy kinetic 
term given by the first line in (29). On the other hand, the longitudinal part, 

has the kinetic term only due to mixing with the field h^ v ; this is why the mixing term, which 
is subdominant at a ^ — (3, plays a key role now. Let us not impose the gauge condition on 
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h^y for the time being. Then the kinetic term for and is given by 

,2 



4*(V) - ^{d&hr - d^K)<t>. (so) 

It can be diagonalized [46] by noticing that the combination (p^d v hl xv — d^d^h^) is propor- 
tional to the linearized Riemann curvature, so the second term in (30) has the structure 
m^Rih^y) • (f). Hence, the kinetic term is diagonalized by a conformal transformation, which 
at the linearized level reads 



m 2 G 



Then the kinetic term becomes 



L km = 4h(V) + \rn%d^dy. (32) 

Upon gauge fixing of h^ v , the longitudinal sector of the theory contains one degree of freedom 
<f> with healthy kinetic term. In this way one recovers the absence of ghosts in the Fierz-Pauli 
theory. 

Note that for general a ^ —(3, the term (32) is subdominant as compared to the term 
(a + (3)(D(J)) 2 that arises from (25). Thus, mixing between scalar parts of h^ v and £ M is 
unimportant in that case, just like in the vector sector. 

A general comment is in order. It has to do with the fact that the Stiickelberg proce- 
dure, with gauge conditions imposed on h^, may introduce spurious solutions to the field 
equations. Consider, as an example, the gauge hoo = 0, ho% = 0. With this gauge choice, one 
has /loo = 2<9o£o, h 0i = <9o£j + d^o- Varying the action of the original theory with respect to 
h^ v , one obtains the field equations 

SS SS SS 

= 0, ^ = 0, — = 0. (33) 



Shoo Sh 0i Shij 

The first two of these equations do not contain second time derivatives; these are constraints. 
On the other hand, substituting h o = 2<9o£o, hoi = <9o£i + <9i£o into the action and varying 
then with respect to £ M and hij one finds 

^° (shoo) + ^ (shoi) °' ^° i^Shoi) °' Sh^ ° 

There are no constraints any longer; instead, all these equations are second order in time. 
Hence, the system (34) has more solutions than (33). However, we are interested in propagat- 
ing modes, i.e., solutions to the linearized field equations that have the form exp(iu;t — ipx). 
In this case the left hand sides of (33) oscillate, unless they are identically zero, so the left 
hand sides of (34) cannot vanish unless (33) are satisfied. The system (34) has the same 
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number of propagating modes as the original system (33). Also, energies and momenta of 
the solutions are the same in the original and Stiickelberg formalisms: if a propagating mode 
is a ghost in the Stiickelberg formalism, it is a ghost in the original theory. Indeed, in gen- 
eral there is unique (modulo terms which do not contribute to total energy and momentum) 
energy-momentum tensor that is conserved on solutions of the field equations. The latter 
observation is valid also at non-linear level, and for adiabatically varying backgrounds. 



2.3 Van Dam— Veltman— Zakharov discontinuity 

The Fierz-Pauli mass term changes the gravitational interaction both between two massive 
bodies and between a massive body and light. This interaction is straightforward to calculate 
in the weak-field approximation [50, 51]. The result is surprising: the prediction for the light 
bending in the massive case is different from General Relativity even in the limit of zero 
graviton mass. This is known as the van Dam-Veltman-Zakharov (vDVZ) discontinuity: the 
linearized Fierz-Pauli theory does not approach linearized General Relativity as vtiq — > 0. 
Taken at face value, this result would mean that the Fierz-Pauli gravity is ruled out, as the 
experimental measurement of the light bending agrees with the General Relativity (see, e.g., 
[52] and references therein). 

Let us consider this phenomenon in more detail. At the linearized level, the interaction 
between two sources of the gravitational field is given by 

GT^P, uXp T^, 

where G is the gravitational coupling constant, P^ u \ p is the propagator of the gravitational 
field and T^ u and T' xp are the energy-momentum tensors of the two sources. The point is 
that the propagators are different in the massive and massless cases. Their structure in both 
cases is 

p X/i e fiu e \p 

p z — m G 

where e l are the graviton polarization tensors; in the massless case the mass in denominator 
is zero. Since the denominator is continuous in the limit of zero mass, it is the sum over the 
polarizations which is responsible for the discontinuity. 

In the massive case there are 5 polarization tensors. The summation over these tensors 
gives 

FP: PpuXp = 2 1 2 I 7}VnxVup + \vppVu\ - \v^V\ P + Go-dependent terms) 1, (35) 

P L J 

where the terms containing p p are irrelevant because they do not contribute when contracted 
with the conserved energy-momentum tensors. In the massless case there are only two 
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polarizations. The propagator in this case takes the form 

GR: P^xp = ^^VnWup + ^VupVuX - ^V^Vxp + (p-dependent terms) |. (36) 

The difference between these two expressions is in the coefficient of the third term. This 
difference persists in the limit of zero mass; this is precisely the vDVZ discontinuity. Note 
also that the difference is in the part of the propagator which couples to the trace of the 
energy- momentum tensor 10 . 

It is worth noting that the vDVZ discontinuity is specific to spin-2 field. In the case of 
vector field, the zero mass limit of the propagator coincides, modulo longitudinal piece, with 
the propagator of massless field, so the vDVZ discontinuity is absent. 

In general, the interaction constants Ggr and Gfp in the massless and massive cases 
are different. The relation between them can be found by requiring that two non-relativistic 
bodies interact with the same strength in the massive and massless theories. In the non- 
relativistic limit only the 00-component of the energy-momentum tensor contributes, so that 
one has 

GR: GqrT^v Pp,v\ P T' x = -GgrT qTq — , 

' A p z 

2 ,1 
FP: GfpTp,v T\ p — -GfpT 00 T Q0 — 



3 11 vjvj uU 



G 



This implies for the zero-mass limit 



3 3 

Gfp = -^G GR = -^G Newton . (37) 

Consider now the prediction for the light bending in both cases. The energy-momentum 
tensor of the electromagnetic wave is traceless. Thus, the third term in the propagator does 
not contribute, and one finds the following expressions for the interaction strength: 

GR: GgrTooTq Q — , 
pi 

FP: G fpTqqTqq— k-. 

p z — m G 



10 One may ask whether one can get around the vDVZ discontinuity by abandoning the weak equivalence 
principle, i.e., by modifying the way in which gravity couples to matter. Indeed, in massive gravity, the 
consistency of the field equations does not require the covariant conservation of the source tensor (unlike in 
General Relativity, where the gravitational part of the field equations — the Einstein tensor — obeys the 
Bianchi identity so the matter part — energy-momentum tensor — must obey the covariant conservation 
law). If, instead of the coupling to the conserved energy momentum tensor, the field couples to some 
tensor S^ u whose divergence is non-zero at finite ttlq and vanishes in the massless limit only the above 
analysis does not go through. This question has been studied in Ref. [53]; the result is that one cannot get 
rid of the vDVZ discontinuity in this way. 
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In view of eq. (37), the light bending predicted in the massive theory in the limit of the 
vanishing graviton mass is 3/4 of that in the massless theory, General Relativity. 

Clearly, the discontinuity is related to the longitudinal polarizations of the graviton, i.e., 
to the Stiickelberg field ^ discussed in the preceding Section. In what follows we will shed 
more light on the mechanism responsible for this phenomenon. 



2.4 Vainshtein radius 

As we already noticed, if the arguments of the previous Section were strictly correct, they 
would imply that the mass of the graviton is exactly zero in the Lorentz-invariant the- 
ory. These arguments, however, have a loophole [54] as they rely on the linear approxima- 
tion. In General Relativity this approximation is valid for distances much larger than the 
Schwarzschild radius of the source. Therefore, the gravitational bending of light that passes 
next to the surface of the Sun is well described in the linear regime. 

The situation is different in the Fierz-Pauli gravity. It has been argued in Ref. [54] by 
studying spherically symmetric classical solutions, that at non-zero graviton mass the linear 
approximation actually breaks down already at distance much larger than the Schwarzschild 
radius, namely at the distance called the Vainshtein radius, 

( M \ 1/5 

r ^te) • (38) 

where M is the mass of the source. Note that the smaller the graviton mass, the larger the 
distance where the non-linear regime sets in. Taking the graviton mass to be of order of the 
present Hubble parameter one finds r v ~ 100 kpc for the Sun, so that bodies orbiting the 
Sun, as well as light passing not far from the solar surface feel the non-linear gravitational 
interaction. The above argument for the wrong bending of light in massive theory is, there- 
fore, not directly applicable. On the other hand, non-linearity of the Fierz-Pauli gravity in 
the entire solar system is a problem by itself. 

The origin of the scale ry is easy to understand by simple power counting [46] . Let us 
recall first how the Schwarzschild radius r$ = 2M/Mp { appears as the expansion parameter 
in General Relativity. Schematically, the quadratic Einstein-Hilbert action (3) with source 
term reads 

d 4 x [M^{dh) 2 +Th] , (39) 



/ 



where h stands for the metric perturbation and T for the energy-momentum tensor. The 
corresponding equations have the solution 11 

h = 1 T 



d 2 Ml l 



11 Throughout this review, when presenting power-counting arguments we ignore numerical factors and 
signs. 
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or, equivalently, 

where M is the total mass of the source. This is the standard Newton's law. The non-linear 
corrections to the action begin with the terms of the type Mp t J d 4 x h(dh) 2 . Requiring that 
these terms are small compared to the quadratic contributions (39) one obtains the condition 
h <C 1, i.e. for the perturbation (40), 

M 

< 1 . 



M 2 x r 



This is precisely the condition r ^> r$ ensuring the validity of the linear approximation in 
General Relativity. 

In the case of the Fierz-Pauli massive gravity this condition has to be satisfied as well. 
However, there is a stronger constraint. In the Stiickelberg language of Section 2.2, this 
constraint comes from the analysis of the field Of particular importance is its scalar part 
£m = d^c/). It follows from the discussion in the end of Section 2.2 that the action for the 
fields h^ v and in the presence of conserved source has schematically the following form 



d 4 x 



M^(dh) 2 + M^m G (d<f)) 2 + Th + m 2 G T(f) +...], (41) 



where we again omitted numerical coefficients and did not write explicitly the mass terms for 
h^ v and <p. The kinetic term here is the same as in (32), while the source term is obtained from 
the standard expression h^T^ by making use of (24) and (31), together with the linearized 
conservation law d^T^ = 0. Note that after the diagonalization of the kinetic term via 
(31), there appears direct interaction of matter with the field <fi. By solving the equations 
of motion we find that at distances much smaller than m G x the gravitational potential h is 
given by eq. (40) and that m G (f) is of the same order, 

=U M 
m G <p = —py—. 

G M 2 f 

The latter formula implies that itself is large and singular in the limit m G — * 0, cf. Section 
2.1, Eq. (16). This is the origin of the non-linearity at large distances from the source. 

Indeed, a non-linear generalization of the Fierz-Pauli mass term would contain higher 
powers of the perturbation h^ v . The lowest term of this type is just h 3 . This term gives rise 
to the non-linear contribution to the action of the form 

J d 4 x M^m G (d 2 <i)) 3 . (42) 
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Another source of terms of the same order is the non-linearity of gauge transformations in 
General Relativity. In general, the coordinate transformation x M — ■> x^ + ^ corresponds to 
the following gauge transformation of metric, 

9A X ) = 9„u(x + C)+d^ x g uX (x + C)+dX X g^ix + O+d^ d u ( p gx P (x + () . (43) 

Writing g^ v = r\^ v + h^ u , one obtains at quadratic order (in both h^ v and £ M ) 

where indices are still raised and lowered with Minkowski metric. Accordingly, the change 
of variables (24) at this order has the form 

+ d^d^x + d^Kx + d^Kx (44) 

The field £ M still does not enter the Einstein-Hilbert part of the action, while the mass term 
receives the contribution, whose schematic form is 



/ 



d 4 x M 2 Pl m 2 G (d0 3 (45) 



that is, the contribution of the form (42) for £ M = d^<f). 

Linearized theory is valid when the contribution (42) is smaller than the quadratic term. 
This requirement leads to the condition 

M 



«1, 



which is equivalent to r < with ry given by eq. (38). 

The situation may be improved by tuning the explicit h 3 terms in the non-linearly gen- 
eralized Fierz-Pauli action in such a way that the leading correction (<9 2 0) 3 is absent. In 
this way the onset of the non-linear regime may be pushed to smaller scales, namely, to the 
distance 



M ^ 1/3 



(46) 



One can show that the situation cannot be improved further [46]. For the graviton mass of 
the order of the present Hubble parameter 12 the non-linear regime occurs at distances below 
r* ~ 10 pc from the Sun, which still covers the whole solar system. 



12 Hereafter we assume that greater values of vtiq would be inconsistent with cosmology. Even without 
this assumption, the requirement that Newton's law remains valid at the scale of galaxy clusters would give 
ma < 10 Mpc -1 . The estimates here and in the rest of this Section would change by about two orders of 
magnitude, with no change of conclusions. 
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Let us make contact of the analysis presented here with the study of the vDVZ discon- 
tinuity. It is clear from (41) that the gravitational field h^ v that interacts with matter, is 
an admixture of the two fields h^ v and m 2 G 7]^ v (j). The field h^ v has the same kinetic term 
L>EH{h[j,v) as the linearized gravitational field in General Relativity, while m^cf) has the ki- 
netic term of a gravi-scalar. Both fields interact with matter at roughly the same strength. 
Keeping the part only, one would obtain, in the massless limit, the propagator of h^ v 
(and hence the propagator of the full metric perturbation modulo longitudinal terms 
omitted in (35) and (36)) which has precisely the form of the propagator in the linearized 
General Relativity. The field adds extra trace piece into the propagator, which does 
not vanish in the massless limit and sums up with the contribution of to the propagator 
(35). 

To summarize, distances below which massive gravity is in non-linear regime are not less 
than given by (46) and hence are very large. One may hope that the non-linear interactions 
would modify the theory in such a way as to make the massless limit smooth [54]. This 
indeed happens in some cases, an example being the DGP model [22] where the non-linear 
interactions affect mainly the gravi-scalar sector and essentially decouple it from other modes 
in the small mass limit, eliminating the extra contribution to the propagator responsible for 
the vDVZ discontinuity. This mechanism, however, does not work in the case of the Fierz- 
Pauli theory [55] . Thus, the Fierz-Pauli gravity about Minkowski background is problematic 
already at the classical level: it most likely contradicts precision tests of General Relativity. 
It becomes even more problematic at the quantum level, as we will discuss below. 

2.5 Strong coupling 

At the quantum level the above non-linearity problem manifests itself as strong coupling at 
the energy scale which is much lower than the naive expectation. 

Both massless and massive gravity should be treated as low energy effective theories valid 
at energies (more precisely, momentum transfers) below some ultra-violet (UV) scale h-uv- 
Above this scale, these theories are meant to be extended into some "fundamental" theories 
(UV-comp let ions) with better UV behavior. The situation here is analogous to the theory 
of self-interacting massive vector field, whose possible UV-completion is non-Abelian gauge 
theory with the Higgs mechanism. In the case of General Relativity, the UV-completion is 
most likely the string/M-theory; whether there exists a UV-completion of massive gravity is 
not known (in fact, the discussion in this Section suggests that the Lorentz- invariant massive 
gravity does not have UV-completion at all). 

One can trust the effective theories only at distances r 3> A^~; at shorter distances one 
has to deal with putative UV-complete theory which most likely has quite different properties. 
Experimentally, Newtonian gravity has been tested down to sub-millimeter distances [56, 57, 
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58, 59], so an effective low energy theory should be valid down to those distances. This implies 

A^<10- 2 cm, or A uv > 1(T 3 eV. 

On the other hand, an upper limit on the UV energy scale Auv can be obtained within the 
low energy effective theory itself. The tree level scattering amplitudes, calculated in the low 
energy theory, grow with the center-of-mass energy E and eventually, at some energy E ~ A, 
become large and hit the unitarity limit. Above this energy A the effective theory becomes 
strongly coupled, and hence cannot be trusted. The entire framework makes sense provided 
that the UV-completion scale is below the strong coupling scale, 

Auv < A. (47) 

A well known illustration is again non-Abelian gauge theory with the Higgs mechanism. The 
strong coupling scale of its effective low energy theory — the theory of massive self- interacting 
vector field — is equal to 

A = mv/g , (48) 

where my is the vector boson mass and g is the gauge coupling. This low energy theory 
is extended into its UV-completion — the gauge theory in the Higgs phase — at the scale 
Ajjv — mH, where m# is the Higgs boson mass: at the latter scale new degrees of freedom, 
the Higgs bosons, show up. Since A = my / g = v, where v is the Higgs vacuum expectation 
value, and m H = V\v, where A < 1 is the Higgs self-coupling, the inequality (47) indeed 
holds in this example. 

In General Relativity, the strong coupling scale is A = M Ph so its UV-completion may 
occur well above accessible energies 13 . In massive gravity, the strong coupling scale A, and 
hence the UV-completion scale Auv, are certainly much below Mpi. Naively, one would 
estimate the strong coupling scale as follows, 

A ~ (M Pl m G ) 1/2 . (49) 

This is a direct analog of (48). Indeed, consider the transverse component of the Stiickelberg 
field £ M which obeys d^ r = 0. The kinetic term in the Lagrangian for this component comes 
from the Fierz-Pauli mass term and schematically has the form 

L<? = M Pl m 2 G (dtitr) 2 . (50) 

The terms, which are cubic in and come from a non-linear generalization of the Fierz- 
Pauli term, as well as from the non-linear change of variables (44), give rise to the interaction 

13 This is not a necessity: the inequality in (47) may be strong. In this regard, an interesting possibility 
is TeV-scale gravities, where Auv is of order of a few TeV, and new phenomena occur at collider energies. 
This possibility is reviewed, e.g., in Rcfs. [13, 60, 61, 62]. 



23 



terms schematically written in (45). Had the form (50) been common to the kinetic terms 
for both transverse and longitudinal components of one would introduce canonically 
normalized field x M = Mpimo^ and find that it enters the kinetic term with unit coefficient, 
while the interaction Lagrangian is 

L >- = M^rJ dx)3 ' (51) 

This theory would indeed have the strong coupling scale (49), since it is this parameter that 
suppresses the higher order operator (51). The analysis of other higher order operators, 
(d£) 4 , etc., would lead to the same conclusion. The same argument applied to theories of 
self- interacting vector fields leads to the estimate (48), hence the analogy between (49) and 
(48). 

The scale (49) is actually quite interesting phenomenologically. For the graviton mass of 
order of the present Hubble parameter, the corresponding distance is 

(M Pl m G )- 1/2 ~ 0.05 mm. (52) 

Were this the true scale of the UV-completion, one would expect novel phenomena in the 
gravitational sector at sub-millimeter distances. In the Fierz-Pauli theory, however, the 
strong coupling scale is actually much lower than the estimate (49), and the corresponding 
distance is much greater than (52). 

The problem occurs in the longitudinal sector, where £ M = d^cf). Let us recall Eqs. (41) 
and (42). According to (41), the canonically normalized field is x — Mpim G (f), and from (42) 
we see that its self-interaction has the form 

The scale that suppresses this higher order operator is now 

A = (Mpim 4 ) 1 / 5 . (53) 

This is the actual strong coupling scale in generic Fierz-Pauli theory. For the graviton mass 
of order the present Hubble parameter this scale is of order 10 -21 eV ~ (10 16 cm)" 1 , which 
is clearly too low. 

In the framework of perturbation theory, the origin of this strong coupling is the growth 
of the propagator and the wave functions of the longitudinal components of massive graviton 
with energy, much in common with the case of massive non-Abelian vector field. Consider 
the amplitude of the four-graviton scattering represented by the diagrams of Fig. 1. The 
external lines of the diagrams for longitudinal gravitons behave like E 2 /m 2 G . The 4- vertex 
gives the factor E 2 /Mp V Therefore, the first of the diagrams gives the contribution of order 

£10 
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a 



b 



Figure 1: Four-graviton scattering in the first two orders of the perturbation theory (panels 
a and b, respectively). Only s-channel diagram is shown in the second order, panel b. 

The second diagram is of the same order, since two leading contributions in the propagator 
cancel out in the on-shell amplitude [63]. Thus, the scattering amplitude indeed becomes 
large at energies of order (53). This has been checked by the explicit calculation of the 
amplitude [63]. 

The scale of strong coupling can be pushed to higher energies by a judicious choice of 
the interaction terms. Indeed, one is able to choose a non-linear extension of the Fierz-Pauli 
theory in such a way that the cubic terms (<9 2 0) 3 vanish. Then the fourth-order terms are 

L mt = M 2 Pl m 2 G (d 2 <f>y = -T7^{d\Y 

PI G 

and the strong coupling scale is given by 

A = (Mp^) 1 / 3 . (54) 

This is the best that can achieved [46], since there are not only self-interactions of the field 
4> (these can be canceled out by an appropriate choice of higher order terms in h^) but also 
interactions between the longitudinal component of the Stuckelberg field, £ M = <9 M 0, and the 
transverse component ^ . 

For the mass of the graviton mg of the order of the present Hubble parameter one finds 
from (54) that A ~ 3 x 10 _13 eV ~ (10 8 cm) -1 . This is also unacceptably low 14 . We conclude 
that the Fierz-Pauli theory suffers from severe strong coupling problem. 

14 This discussion refers to flat background. It may, in principle, happen that effects due to curvature push 
the strong coupling scale to higher values, as occurs, for instance, in the DGP model [22]. This does not 
happen in the Fierz-Pauli case [55]. 
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2.6 Fierz— Pauli theory in curved backgrounds: Boulware— Deser 
mode 

2.6.1 Cosmological background 

If the background is not exactly Minkowskian, the Fierz-Pauli cancellation no longer works, 
and the ghost or tachyon mode reappears in the spectrum. This mode exists for arbitrarily 
high spatial momenta, and hence it is unacceptable phenomenologically. This phenomenon 
is known as the Boulware-Deser instability [64]. Importantly, it occurs irrespectively of the 
way the Fierz-Pauli theory is generalized to curved space-time [55]. 

Let us see the appearance of the Boulware-Deser mode explicitly, by working out an 
example of the cosmological background. As we have discussed above, the scalar sector is 
the most problematic; indeed, as we will find shortly, the Boulware-Deser mode emerges 
precisely in this sector. So, we concentrate on the scalar sector in what follows. 

To warm up, we begin with General Relativity. Let the metric with perturbations have 
the following form 

ds 2 = a 2 (r])(r]^ + h^dafdx" , (55) 

where 77 is conformal time. Note that we have changed the definition of here; in previous 
Sections h^ u denoted the deviation of from background metric, while here this deviation 
is equal to a 2 h^ v . In what follows we will raise and lower indices by Minkowski metric, so 
that by definition h v = i] uX h^x. Another convention is that summation over spatial indices 
will be performed using 5ij, and we will never use covariant derivatives in explicit formulas. 
Hence, the dependence on the scale factor will be always explicit. 

The linearized gauge transformations (13) are generalized by making use of (43). Ac- 
cording to our conventions, we define = i] fiu ( u and write the gauge transformations in this 
background as follows, 

- spatial, Q = -di( L : 

5B = C L , 5E = Cl (56) 

- temporal, Co : 

5B = ( , 6<p = $ s + HCo, <ty = WCo, (57) 

where we use the notations (5) and specify to the scalar sector. Hereafter prime denotes 
d/drj and 

a 

To consider expanding Universe, we introduce positive cosmological constant, the corre- 
sponding term in the action being 

S A = -6H 2 M^ / dx y/=jj. 
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Here the constant H$ is, by the Einstein equations, the Hubble parameter of the de Sitter 
space in the theory without graviton mass and without matter. The background equations 
are 



3H 2 a 2 



n 2 

2H' + n 2 

and their solution, the scale factor of the de Sitter space-time, is 

1 



(58) 



H V 



The quadratic part of the cosmological constant term is 



Sf = 2H 2 M 2 Pl 



d 3 xdr] a 4 



^2 _ 9 ^ + 3vAE _ 9^ 2 + + 3 {AE)2 _ 3 {diB)2 



(59) 

where A = didi. 

By appearance, the part (59) in the action resembles the graviton mass term, as it 
contains no derivatives of the fields. This term does not have the Fierz-Pauli structure, and 
both y9 2 -term and (<9j-B) 2 -term are present. Hence, unlike in Minkowski background, none 
of these fields appears to be a Lagrange multiplier, and one might suspect that the theory 
has two dynamical scalars ip and E. This does not happen in General Relativity: the two 
modes remaining after integrating out (p and B are pure gauge modes. 

Indeed, let us consider the Einstein-Hilbert and cosmological terms together. Off shell 
(that is, for arbitrary background) the quadratic part is 



S 



(2) 

EH+A 



2M 2 Pl 



d 3 xdr) a 2 



- 2 v Aip - 2^'AB + 2^'AE' + 3^" - ipAip 
+H(2<pAB - 2ipAE' + 6#') 
+ 



\ H2 + \ Hy 



y? 2 + 



- V - -H 2 a 2 
2 2 



+ {n 2 - H 2 a 2 ) (9^ - 3<pAE + \{diBf^j 
+ {pi! + U 2 - 3H 2 a 2 ) (-i/jAE - hyAE) 2 



(60) 



Now, because of the background equations of motion, the two last lines in this expression 
vanish, and the action simplifies to 



'EH+A 



2M 2 Pl 



d A x a 2 [-2 v Aip - 2^'A5 + 2^' AE' + 3^" - ipAip 

+H(2<pAB - 2cpAE' + 6^0 
+H 2 {-3^ 2 - 9^ 2 )] . 



(61) 
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As expected, B and ip are non-dynamical fields with non-degenerate quadratic term. Their 
equations of motion give 

<p=rfl/, B = ^ + E '- (62) 

The miracle is that after these expressions are substituted back into the action (60), and inte- 
gration by parts is performed, one arrives at vanishing quadratic Lagrangian, L^ H+A (ip, E) = 
0, where, again, the equations for background were used. Thus, ip and E are arbitrary func- 
tions of a; M , while (p and B are related to them via (62). These configurations are pure gauges 
of the form (56) and (57). To check that, one again uses equations for the background (in 
particular, H' = H 2 ). 

This miracle of course happens because of gauge invariance. Once gauge-invariance is 
broken explicitly by the graviton mass terms, miracles do not happen, and the Boulware- 
Deser mode appears. 

Let us introduce the mass term generalized to curved space-time, 

There is a lot of arbitrariness at this stage: general covariance is explicitly broken, and S m 
may contain various combinations of g iiv i rf' v ', g^V^u, \fg~-, etc. The discussion that follows is 
not sensitive to the particular form of the mass term; it is assumed only that it does not 
depend on the derivatives of the metric, becomes the Fierz-Pauli term in Minkowski limit, 
and, for simplicity, that it is proportional to a single mass parameter m,Q. To illustrate the 
general analysis, we will use the simplest generalization of the Fierz-Pauli mass term, 



Srn = \Ml l J d'x |-^<V P (^-^)(^p-^) + ^[^(^-^)] 2 } 



(63) 



We stress that this form will be used for illustration purposes only. 

There is a coordinate frame where 7]^ v = diag(l, — 1, — 1, — 1), and we assume that the 
background space-time is homogeneous and isotropic in this frame. Then the metric has the 
general Friedman-Robertson- Walker form (plus perturbations) in this frame, 

ds 2 = a 2 {t)[n 2 {t){\ + h m )dt 2 + 2n(t)h 0i dtdx i + (-<% + h^dx 1 dx j \. 

In this frame, the background is characterized by two metric functions, a(t) and n(t). It is 
still convenient to work with conformal time r), that is, to perform the change of variables 
dr\ = n(t)dt. In other words, we will work in the conformal frame where the background 
metric has the form (55). Consistency of the filed equations implies an equation relating 77,(77) 
and a{rf), which generically has the form n' = f(n, a)a', see Appendix A. Note, however, that 
its solution is not unique: at a given moment of time one can take n and a arbitrary. 
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Once the mass term is added, the quadratic part of the action in cosmological background 
has a very general structure, 

S ( EH +A+mG = 2M 2 Pl J d 3 xdr] a 2 [-2^A^ - + 2^'AE' + 3^" - ^A^ 

+H(2(pAB - 2(pAE' + 6</t0') 

+^cpip + fi 2 ^>AE + fi 3 ipAE] . (64) 

The terms in the last two lines have three-fold origin. First, there are terms that do not 
vanish in the limit m 2 G — > 0; these are the terms in the last line of eq. (61). Second, the 
background equations no longer coincide with eq. (58), so the last two lines in (60) do not 
vanish. Finally, there are contributions due to the mass term S m itself. We give more detailed 
treatment of the latter two contributions in Appendix A. 

There are generically no specific relations between the terms in the last two lines of 
(64). The fields (p and B are still non-dynamical, and after integrating them out one no 
longer obtains zero action for ijj and E. Instead, the action contains terms with two time 
derivatives of ip and E, some of which are explicit in (64) and some emerge after (p and B are 
integrated out (note that the terms in (64) proportional to (p and B do not contain second 
time derivatives of ip and E, so higher time derivatives of the latter fields do not appear). 
Both if) and E are dynamical fields, hence the scalar sector has two propagating modes. This 
is in contrast to the theory in Minkowski background, with a single propagating mode in the 
scalar sector. The extra mode is precisely the Boulware-Deser degree of freedom. 

Let us now specify to near-Minkowski background, 

H 2 <^m 2 G , |a-l|<l, |n-l|<l. 

Note that in this limit H coincides with the standard Hubble parameter. We will be interested 
in relatively high momenta, p 2 3> m 2 G . A detailed analysis reveals the following features. 
First, the properties of the scalar perturbations are different in the two ranges of momenta 



<g- — k. 
H 2 ' 

4 



(n) ■■ P 2 » • (65) 

Hence, the high momentum limit and Minkowski limit do not commute. We discuss the 
range (i) in Appendix A and here we briefly summarize the results. There are indeed two 
propagating modes. One of them is the Fierz-Pauli mode whose dispersion relation remains 
uj 2 = p 2 , up to small corrections. The second mode is a ghost or tachyon-ghost (tachyon 



29 



and ghost at the same time). Being ghost means that the energy is unbounded from below; 
if the mode is simultaneously tachyon, it exponentially increases in time. 

Let us now consider the range (ii), i.e., the high momentum limit. To integrate out 
non-dynamical fields, we solve equations obtained by varying the action with respect to ip 
and B. These equations are written explicitly in Appendix A, see eqs. (165) and (166). We 
need the expression for if to the leading order in derivatives and the expression for B to both 
leading and sub-leading order; the reason is that there are cancellations. The corresponding 
expressions are 

* = ^ 

H 2H 2 A A 2H 

Plugging these back into the action (64) and integrating by parts, we arrive at the action 
for dynamical fields, 

, A*2-Jn|, /A _ m 2 Ba _,_ , m\.. 2 



n 



ip'AE - ^diE'diE' + ^p(A£) 2 j . (66) 



Note that the terms with the highest derivatives, ip'AE', have canceled out. 

We now see explicitly that there are two propagating modes. We also see that their 
action (66) is singular in Minkowski limit 15 . Indeed, by comparing (64) with the Lagrangian 
in Minkowski space-time (eq. (12) with a = —f3 = —tjiq), one finds that in Minkowski limit 



2 

m B -> — — , n 2 -> -m G , (67) 

while m 2 E and m 2 , tend to zero. Thus, the first and the third terms in (66) have the coefficients 
that diverge in the Minkowski limit, in which 7i — > 0. Furthermore, at 7i 2 m^, the 
first term in (66) has the overall positive sign (because of the first relation in (67)), which 
corresponds to negative energy. This energy is unbounded from below, so there is a ghost 
or tachyon in the spectrum. We show in Appendix A within the model (63), that one of the 
modes still has the dispersion relation 

2 2 

U =p , 

while the other mode is tachyonic or non-tachyonic, depending on the relationship between 
(a — 1) and (n — 1). 



15 According to the above discussion, this is the limit in which one first sends p 2 to infinity, and only then 
approaches Minkowski space-time. 
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2.6.2 Stiickelberg treatment 

A lesson from the above analysis is that once the gauge invariance is given up, there are two 
scalar propagating modes in curved backgrounds no matter how close these backgrounds 
are to Minkowski space-time. Minkowski limit is singular, and for nearly Minkowski space- 
time, one of these modes is necessarily pathological. Another lesson is that the explicit 
analysis of this Boulware-Deser mode is rather messy. On the contrary, the Boulware-Deser 
phenomenon is relatively straightforward to see within the Stiickelberg formalism [46, 43, 
55, 65]. 

Let us consider backgrounds which are only slightly different from Minkowski space- 
time. In that case the perturbation theory in is adequate. The quadratic Lagrangian 
has been discussed in previous Sections. Generically, at the cubic order one has the following 
contributions to the Lagrangian 

m 2 G [\i{hff + \ 2 h» h vX h vX + A 3 K v h\h^\ (68) 

with Ai 5 2,3 of order one. For non-trivial background the field = g^ — rj^ has non-zero 
background part h$. To perform the Stiickelberg analysis, one changes variables in the way 
dictated by (43), 

9^(x) = g^{x + + d^ x g uX (x + £) + d^ x g^(x + £) + d^ x d u ^ g Xp {x + £), (69) 
where 

9^u{x + = rj^ + h$(x + g) + h IM/ (x + £) 

= V,u + h$(x) + h, u (x) + d x h$(x)e + d x h, u {x)e + ... 

Here and £ M are perturbations, and h^uix) is meant to be gauge fixed. As before, the 
Einstein-Hilbert action does not contain the field Concentrating on the longitudinal 
Stiickelberg field £ M = <9 M 0, and inserting the decomposition (69) into both quadratic Fierz- 
Pauli term and cubic term (68), one obtains the quadratic action for 0, 



m G 



8 G 



d^dy + Ai/ig • d^d v 4> ■ u<p + A 2 /^(n0)' 



(70) 



where we keep the Fierz-Pauli contribution (32) that is independent of h$, as well as the 
part which is proportional to the background h$ and has the largest number of derivatives. 
Omitting other terms is legitimate for studying slowly varying backgrounds and perturba- 
tions whose momenta obey u 2 , p 2 3> m 2 G . 

One point to note is that any configuration obeying D0 = solves the field equation 
following from (70). This explains why we have always found a mode with the dispersion 
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relation uo 2 = p 2 when studying the theory in cosmological background. More important 
is the fact that the Lagrangian (70) is of the fourth order in derivatives of 0, so there is a 
ghost in the spectrum. To see this explicitly and estimate the mass of the ghost, consider a 
simplified version 16 of (70), 



to; 



G 



m 2 G d^(t) + \h {c) ■ (D0) : 



This Lagrangian is equivalent to 



rn 



G 



mld^d^ + 2A h (c) ■ ~ A h (c) ■ X 2 



m 2 G 



TO" 



G 



V m G J \ m G 



where x is a new field. The first term in the last expression corresponds to the modified Fierz- 
Pauli mode [(f) + (\/m G )h^x], while the second term is the kinetic term for the Boulware- 
Deser mode x- The latter has negative sign, and hence the Boulware-Deser mode is a ghost 
(depending on the sign of Xh^ it may be a tachyon-ghost at sufficiently low momenta). The 
local value of its mass squared is of order 



TO 



m BD ~ 



G 



(71) 



This explains why the high momentum limit and Minkowski limit do not commute, as we 
have seen explicitly when studying the theory about cosmological background. 

The mass (71) diverges as the background approaches the Minkowski limit. The latter 
property might raise a hope that the Boulware-Deser instability is not so dangerous. If the 
mass of the ghost turns out to be larger than the UV scale Ajjv, one cannot trust the above 
analysis, since at energies exceeding Ajjy one nas to deal with the unknown UV-completion 
of the theory. This observation, however, does not save the Fierz-Pauli theory [55]. Indeed, 
away from an astrophysical source of mass M one has 



M 
M 2 ? 



so that 



m BD ~ 



(72) 



16 The argument below is straightforwardly generalized to the case (70). One makes use of the spatial 
Fourier representation, and writes the Lagrangian in the form Acf) 2 + B(f> 2 + C<t> 2 , where coefficients depend 
on p. One gets rid of the second time derivatives by introducing a new field \ an d finds that the resulting 
structure of the kinetic terms is Bcj) 2 + 2A\4>- This structure implies that there is a ghost, cf. (21). 
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where r* is the radius given by (46). Hence, the Boulware-Deser instability definitely occurs 
in the interval (r*A;/y) 2 r* > r > r *, in which tjibd < Auv and at the same time the linear 
approximation is valid. This interval is not empty, unless A^y > r*. Recall that the value 
of r* for the Sun is of order of 10 pc. Hence, the Fierz-Pauli theory with A^ v > cannot 
be trusted in the solar system; had the graviton Lorentz-invariant mass, one would either 
encounter rapid instabilities or have to deal with an unknown UV-completion instead of the 
effective Fierz-Pauli theory. 

To conclude, Lorentz-invariant massive gravities in four dimensions are full of pathologies. 
One way towards getting around these pathologies is to give up Lorentz invariance. 

3 Lorentz-violating theories: generalities 
3.1 Lorentz-violating mass terms 

In this and following Sections we are going to study the class of theories with Lorentz- 
violating mass terms. We assume that Minkowski space-time is a solution of the corre- 
sponding field equations, and that the Euclidean symmetry of 3-dimensional space is not 
explicitly broken in perturbation theory about this background. Then the quadratic action 
for perturbations about Minkowski background is 

= 4 2 i + S m , (73) 

where S E jj is the quadratic part of the Einstein-Hilbert term, explicitly given by (3), and 
S m is the graviton mass term. The Lagrangian of the latter is 

L m = ^[^o^oo^oo + 2mlh 0i h 0i - m^U^h^ + m\hahjj - 2mlh 00 hu]. (74) 

Here, as before, h^ u are perturbations about Minkowski metric. The Fierz-Pauli Lagrangian 
is obtained when all masses in eq. (74), except for m , are taken to be equal, 

FP : ml = , ml = ml = m\ = ml = m 2 G . 

The latter property explains the conventions used in eq. (74). In what follows we denote by 
m the overall scale of the masses mo, . . . , m 4 . 

Let us again make use of (3+l)-decomposition (5). This formalism is particularly ap- 
propriate here, as it fully takes into account 3-dimensional Euclidean invariance, the only 
symmetry which is not explicitly broken by general mass term. The Lagrangian in the tensor 
sector is the sum of the kinetic term (7) and the mass term 

2 

T _ m 2 h TT iTT 
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Hence there are two propagating tensor modes with relativistic dispersion relation 

2 2,2 

to = p + m G , 

where 

m G = m 2 (75) 
is the mass of tensor gravitons. The requirement that these modes are not tachyonic gives 

m 2 2 > 0. 

We will assume in what follows that this is the case. 

In the vector sector, the quadratic Lagrangian is the sum of the Einstein-Hilbert part 
(8) and the mass term 

Tfl 2 7T1 2 

L m , v — -^-SiSi 

A novelty here, with respect to the Fierz-Pauli case, occurs at the special value mi = 0. In 
this case, the field Si is the Lagrange multiplier, leading to the constraint F — 0. Hence, 
there is no propagating modes in the vector sector, unlike in the Fierz-Pauli theory, 

m± — : no propagating vector modes. (76) 

For mi 7^ 0, the analysis of the vector modes parallels that given in Section 2.1. For m\ > 
the vector sector contains two normal propagating modes. The canonically normalized 
propagating field is now 

^(p) = MpimJ 2 p2 2 Fi(p) 
V p 2 + mf 



with the dispersion relation 

2 

to 2 = ^f(p 2 + m\) 
m\ 

In the case m\ < and m 2 ^ 0, the modes are ghosts or tachyons at high spatial momenta, 
so we impose the restriction 

m\ > 0. 

Let us now turn to the scalar sector. The full quadratic Lagrangian is 



4 2) = 2 [d k ^d k ij - 3d ^ + 2{d kV d k ^j + d k Bd d k 4> + d AEd d k ij)} 
+ 



m o 2 m T/on\2 3(3mo — mi) , 2 
-^V + -fidiBf + 3 - ?V 



-{3m 2 3 - m 2 2 )ipAE + -(ml — ml )(AE) 2 + m^(p(3ip - AE) 



(77) 
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For general masses, there are two propagating modes, one of which is a ghost. Indeed, 
for vrt\ 7^ 0, the field B can be integrated out, giving the following contribution to the 
Lagrangian, cf. (18), 

L B = -^A^. (78) 
m\ 

The field ip can also be integrated out, and the corresponding contribution to the Lagrangian 
of the dynamical fields ip and E does not contain time derivatives. Hence, the terms with time 
derivatives in the resulting Lagrangian for ip and E again have the structure (21), implying 
that there is a ghost. Generally, the ghost exists at all spatial momenta and frequencies, so 
the observations we will make in Section 3.6 do not help. One has to get rid of the ghost 
mode. 



3.2 Eliminating the second scalar mode 

While in general the theory is not healthy, at special values of masses the ghost mode does 
not exist. This is the case, in particular, if either ip or B or both remain the Lagrange 
multiplier (s). The point is that the corresponding constraint kills the second mode in the 
scalar sector, while the remaining mode, if any, may well be normal. The two choices of the 
mass pattern that do the job are m = and m x = 0. Let us discuss them in turn. 



3.2.1 m = 

In the case mo = 0, the field ip is the Lagrange multiplier, leading to the constraint 

2Aip = m 2 4 (?>ip - AE). (79) 

Assuming that m\ ^ and m 4 ^ 0, one integrates out the field B with the result (78) 
and expresses AE in terms of ip using the constraint (79). Then ip is the only remaining 
dynamical field. The terms in its Lagrangian which are relevant at high momenta and 
frequencies, w 2 ,p 2 ^> m 2 , are 



— 2 -^) WW - — — 4— ^(^) 2 

mi mi I m; 



+ 



where the omitted terms have at most two derivatives. This Lagrangian is healthy at 
w 2 ,p 2 ^> m 2 provided that 

m\ > ml > , m\ > m\. 

We will see below, however, that this case is problematic. 

Within the class of theories with m = there are subclasses in which more conditions 
on masses are imposed. As an example, already from the above analysis it follows that the 
case m 4 = 0, the case m 4 = m\ and the case m 2 = m 3 are all special. Detailed study of 
these "boundaries" is given in Ref. [42]. 
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3.2.2 mi = 

For mi = the field B is the Lagrange multiplier. The corresponding constraint is ip — 0, 
implying ip = for propagating modes. Inserting ip = back into the action, one finds that 
there remain no terms with time derivatives, so there are no propagating modes in the scalar 
sector. The vector sector has the same property, see (76). Thus, the only propagating modes 
in the theory with mi = are tensor gravitons with mass (75). We will discuss this theory 
in detail in Section 5. 

3.2.3 m 2 = m 3 , m 4 = 

By inspection of the Lagrangian (77) one uncovers one more special case, m 2 = m 3 and at 
the same time m 4 = 0. It is now the field E, rather than the non-dynamical fields ip and B, 
that plays special role. The field AE enters the Lagrangian linearly, and the corresponding 
field equation is 

2^ + (Smj - m 2 2 )i) = 0. 

Thus, there are no high frequency modes of ip irrespectively of spatial momenta. If one is in- 
terested in high frequency modes only, one sets — 0, and after that obtains the Lagrangian 
without time derivatives. Hence, there are no propagating modes of high frequencies in this 
case. 

So, we see that there are special cases in which Lorentz-violating massive gravity does 
not contain ghosts in linearized theory about Minkowski background. Further analysis of 
numerous issues raised in Section 2, as well as other points to worry about, is conveniently 
performed in the Stuckelberg formalism. 

3.3 Symmetries vs fine tuning 

As we have seen in Section 2.6, getting rid of the second scalar mode in Minkowski back- 
ground is by itself insufficient to make the theory healthy. In the Lorentz-invariant theory, 
the absence of the second mode in Minkowski background is due to the fine-tuning relation 
a = —f3 imposed on the mass term in the Lagrangian (2). This fine-tuning is, however, de- 
stroyed in curved backgrounds, and the second, Boulware-Deser mode reappears. Likewise, 
similar fine-tuning relations are problematic in Lorentz-violating theories. Let us see this 
explicitly in the theory with m = and no other relations between the masses. It is con- 
venient to make use of the Stuckelberg formalism, and proceed in analogy to Section 2.6.2. 
At the quadratic order, the Stuckelberg part of the metric (69) that contains the derivatives 
of f is 

9p,v = ?» + V = Vnu + d^ u + + d^ x d u £ x ■ (80) 
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Let us concentrate on the terms involving the field £o- 111 Minkowski background, these come 
from the second and fifth terms in the Lagrangian (74) and read 

L m = ^ml(d Ci + diZ )(doZi + d& ) - mld CodS + • • • , 

where omitted terms contain £j only. Upon integrating the second term by parts, one observes 
that £o is n ot a dynamical field, so that there is at most one propagating degree of freedom in 
the scalar sector, the longitudinal part of £j. This is in accord with the discussion in Section 
3.2.1. 

(c) (c) 

Once the background is slightly different from Minkowski, = r]^ + hfj, the latter 
property is lost. Indeed, due to the quadratic term in (80), the mass terms themselves 
include the combination 

-\mlhnh$ = -Imlidohyhg + . . . 

The field £ becomes dynamical, the second mode reappears, and in some backgrounds 
(appropriate sign of h\f) this mode is a ghost. 

There is an elegant way out of this fine-tuning problem, however [42]. Relations between 
the masses, instead of being results of fine tuning, may be consequences of unbroken gauge 
symmetries, which are parts of the gauge symmetry of General Relativity. These residual 
gauge symmetries may then be expected to protect the theory from becoming pathological 
when one extends it to curved backgrounds and/or generalizes it to include possible UV 
effects (the latter will be discussed later on). In several cases this approach does lead to 
healthy infrared modified gravities. 

One can think of various residual gauge symmetries [42]. In this review we discuss only a 
few of them, which either are known to give rise to interesting theories or serve as examples 
of the failure of this approach. The first unbroken symmetry we are going to elaborate on is 

(i) : x i -» x i + C(x\t) . (81) 

This symmetry implies that all masses but m vanish; this is the symmetry of the ghost 
condensate theory [40]. 

The second symmetry to be discussed is 

(ii) : t ^t + ( (x\t) . (82) 

This symmetry leads to the constraint m = m\ = m± = 0. We will see that the correspond- 
ing theory has problems with the stability against UV effects, see Section 3.5. 
The third symmetry is 

(Hi) : x i -> x l + C(t) . (83) 
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This symmetry is sufficient to ensure that m\ = 0, while other masses are unconstrained. We 
have found in Section 3.2.2 that in this case the linearized theory in Minkowski background 
is free of pathologies. We will see that the corresponding theory [42, 44, 43] is healthy both 
in nearly Minkowski and in general cosmological backgrounds. It is UV-stable as well. In 
fact, as we discuss in Section 5, this theory is quite interesting from the phenomeno logical 
viewpoint. 

3.4 Lorentz-violating scalars 

A convenient way to analyze the behavior of an infrared modified gravity of the type we 
discuss in this review, and also to promote the perturbation theory about Minkowski back- 
ground to a full low energy effective theory, is to start off with a generally covariant theory 
with additional scalar fields (f) a , a = 0, 1,2,3, which we will call Goldstone fields. Breaking 
of Lorentz-invariance occurs when these fields obtain background values which depend on 
space-time coordinates. In this approach, Lorentz-invariance is broken spontaneously, as the 
original action of the theory is Lorentz- invariant, while the background is not. 
As an example, in Minkowski space-time, the background fields are 



where A is a parameter with dimension of mass, and a and b are coefficients of order one. 
In our convention, the fields <p a have dimension of mass. The background fields (84) are 
solutions to the equations of motion if the Lagrangian contains their derivatives only. The 
latter property automatically implies that the Lagrangian is invariant under shift symmetry 
4> a (x) — > 4> a (x) + \ a with constant \ a . This means that the translational symmetry of 
(3 + 1) -dimensional space-time is unbroken by the background (84), since one can undo a 
translation by shifting the fields <p a . Likewise, to preserve spatial rotation symmetry one 
requires that the Lagrangian is invariant under 5*0(3) rotations of the fields, l — > A*0 J . 
Thus, one is lead to consider theories whose actions, at the one- derivative level, have the 
general form 



0° = aAH, 
ft = bA 2 x\ 



(84) 



S — Seh + S,/), 
where Seh is the Einstein-Hilbert action, while 



(85) 




(86) 
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and 

x = ^<r^v, 

Y 13 = -^iTdrfdrP, (87) 

q = j- s ^= t ^p tl]k d^d u ^d x ^d^ k . 

Internal indices k are to be contracted in the action (86) with either 5^ or e^. Hereafter 
we will not use the convention (4) when writing the Lagrangian for the Goldstone fields; this 
will simplify power counting. The combination Q is in fact not independent (apart from 
possible subtleties related to the presence of the e-symbol): its square can be expressed in 
terms of X, V 1 and Y l K So, in what follows we only consider functions F depending on the 
former three combinations. 

The energy-momentum tensor of the configuration (84) vanishes in Minkowski space- 
time, and hence Minkowski space-time is a legitimate background, provided that a and b are 
such that 

1 „ 2 9F 

-2 F + a iX = 

1 f)F 

L FSij + tf-?fL, = 

2 J dY % 3 

dF 

w - < 88 > 

at X = a 2 , Y lj = —b 2 5 lj and V 1 = 0. In what follows we often set a = b = 1 by field 
redefinition. 

The theory with the action (86) is to be considered as an effective field theory valid at low 
energies only 17 . The UV cutoff in this theory A uv is to be somewhat below A, cf. Section 
2.5. Indeed, expanding the fields about the background (84), 

0" = 0" + vr", 

one obtains the following structure of the Lagrangian for the perturbations, 

L n = (dn) 2 + i(<97r) 3 + . . . (89) 



17 What is exactly meant by low energies becomes clear after effects of higher-order terms are understood: 
these are energies and/or momenta at which higher-order terms blow up. Since the actions we discuss are 
Lorentz- invariant, the way the UV cutoff A enters the action is dictated by Lorentz-invariance. On the other 
hand, the value of energy at which the low energy theory ceases to work may be different from the value of 
spatial momentum, due to the spontaneous Lorentz-violation by background scalar fields. 
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implying that Auv i$ A. In this regard, an important issue is the UV stability of the 
theory [42]. In low energy effective theories, there is no reason to think that the low energy 
Lagrangian contains terms with first derivatives only. So, one has to worry about the effects 
of higher-derivative terms like A -2 ^ ' g^d^d^dxdpcf) 01 . Naively, these terms are suppressed 
below the cutoff scale, i.e., at p 2 ,uj 2 <C A 2 . However, if the kinetic terms in (89) do not have 
generic structure, the higher- derivative terms may become important. We will encounter 
examples of this sort in what follows. 

Turning on gravity, still in Minkowski space-time and in background (84), one observes 
that the gauge transformation x^ — > + (^(x) corresponds to the following transformation 
of the fields n a , 

7T a (x) ^7T a (x)+A 2 C(x). 



Hence, 



= A- 2 n a (90) 



are the Stiickelberg fields of the previous Sections. In the unitary gauge, ir a = 0, one has 
X = 1 — /iqo ) = — 1 — hij, etc., so the quadratic in part of the action contains the 
mass term (74), the scale of graviton masses being 

"* = ]£• (91) 

which is in accord with (49). Hence, the class of theories (85) indeed has all expected 
properties of Lorentz-violating massive gravity. A convenient feature of this construction 
is that the behavior of the theory at p 2 , oo 2 ^> m 2 G in or near Minkowski background can 
be analyzed by studying the Goldstone sector only. Also, the theory away from Minkowski 
background is well defined. 

Needless to say, for general Lagrange function F the theory is pathological. As we 
discussed in Section 3.3, it may not be pathological if a part of the gauge symmetry of 
General Relativity remains unbroken. In that case the Goldstone action (86) does not have 
the generic form. As an example, the residual gauge invariance t — > t + C°(^*, t), see (82), in 
the Goldstone language implies that the Lagrange function F is invariant under the change 
of variables 

O -0 O + H°(0\0 ) (92) 

with arbitrary function S°(0 l , 0°). Indeed, only in this case the background (84) is invariant 
under the gauge transformation t —> t + (°(x\t) supplemented by a field redefinition. It 
is in this way that the Lagrangian, and hence graviton mass terms, get constrained by the 
requirement of residual gauge invariance in the Goldstone framework. 
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3.5 An example of UV unstable theory 



To illustrate the problem with UV stability that one may encounter in otherwise healthy the- 
ory, let us consider the model with the residual gauge symmetry (82), implying the constraint 
on the Lorentz-violating graviton masses m = m\ = m 4 = 0. In the Goldstone language, 
this symmetry translates into the field transformation (92). This can be a symmetry of the 
Goldstone action (86) only if the field 0° is absent altogether. Hence, the Goldstone sector 
of the theory has three fields <p l and at the one-derivative level the action is 

S^ = j d 4 x ^A 4 F(Y^). 

where Y^{(f) % ) is given by (87). As pointed out in Section 3.3, the general Lagrangian for 
the theory of Goldstone fields, viewed as low energy effective theory, contains higher order 
terms, e.g. 

AF = ^g^g^d^dxd^. (93) 

We will see that in the model discussed here, these terms are important and, in fact, give 
rise to pathologies in the spectrum. 

Let us consider this theory in Minkowski background, discarding the higher-order terms 
for the time being. Lorentz-invariance is broken by the background 

ft = A 2 x i (94) 

which obeys the field equation for the Goldstone fields. Expanding the fields near this 
background, ft = ft + n\ one obtains the quadratic Lagrangian for the Stiickelberg fields 
it 1 . This Lagrangian involves the first and second derivatives of the Lagrange function F 
evaluated at Y^ = Y l i(ft) = — 5^, which we parameterize as 

FiSij, 

F2iSijS k i + F 22 (SikSji + SuSjk) . 

The quadratic Lagrangian is 

L n = F^ttW + 2F 21 d i ir i d j ir j + 2F 22 (d i ir j d i ir j + drfdji?) . (95) 

At first sight, this Lagrangian describes three scalar fields with healthy kinetic term. This, 
however, is inconsistent with the fact that for mi = there are no propagating modes in the 
vector sector, see (3.1). The resolution of this discrepancy has to do with the requirement 
that the energy-momentum tensor of the background field configuration (94) vanishes, so 



dF 



d 2 F 



BY i WY kl 



(0) = 
(0) = 



41 



that Minkowski metric is a solution of the complete set of field equations. The corresponding 
conditions are read off from (88), which in the absence of the combinations X and V 1 yield 

Hence, F\ = in (95), so the one-derivative action actually corresponds to a theory with 
no propagating modes: at this level, all Stuckelberg fields enter the action without time 
derivatives, so none of them is a dynamical field. 

Once the higher order terms are added, the situation changes. The terms like (93) contain 
time derivatives, and there is no symmetry that would forbid them. In terms of the fields 
it 1 , these contributions have the following structure, 

AL n = L [(«9 V) 2 - (d d^) 2 + ...]. (96) 

These contributions dominate at high frequencies, precisely because the Lagrangian (95) does 
not contain time derivatives, i.e., precisely because the fields n l are not dynamical at the 
one-derivative level. With the higher order terms included, the fields 7i l become propagating, 
and their dispersion relation is 

lu a = const • p 2 A 2 . 

This means that at least one of the modes for each tc 1 is tachyonic, and the corresponding 
"frequency" is high even at moderate spatial momenta (being, nevertheless, smaller than the 
cut off scale A uv ). Because of that, the model is unacceptable. 

Hence, the fields that are non-dynamical in Minkowski background and at the level of 
one-derivative Lagrangian, need to be treated as suspects. They may become propagating 
in curved backgrounds and/or due to higher order terms in the Lagrangian. We will refer 
to the former possibility as the Boulware-Deser instability, while the second is called UV 
sensitivity [42]. 

To conclude the discussion of the model studied here, let us point out that in the lan- 
guage of metric perturbations, its UV sensitivity is sensitivity against derivative terms in 
the Lagrangian for h^ u . As an example, the first, most unwelcome contribution in (96), in 
terms of metric perturbations corresponds to the term 

2 



= M 2 Pl J d 4 x m 2 G ■ Q^/ioo - d h ^j 



2 



where we recall the relation (90) that implies the correspondence h ~ A 2 dn, and use 
(91). This term is invariant under residual gauge transformations (82), is suppressed by the 
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anticipated UV scale (49) as compared to the graviton mass terms, so there is no reason for it 
not to be present. In this language, what we have done was to find out that the theory with 
two non- vanishing graviton masses m 2 and m 3 has tachyons in the spectrum, once generic 
one-derivative terms in h^ v consistent with the symmetry (82) are added. 

3.6 Not-so-dangerous instabilities 

To end up this Section, let us add digress to more phenomenological discussion of insta- 
bilities in Lorentz- violating theories. In these theories one can allow for tachyons and/or 
ghosts provided they exist at low frequencies (particle energies) only. In viable theories, the 
frequency cutoff A tc for tachyons can be somewhat higher than the present Hubble scale H , 
while for ghosts the cutoff A gh can be many orders of magnitude higher than H . Let us 
discuss this issue in some detail, assuming that ghosts and tachyons interact with ordinary 
matter only gravitationally. 

Consider tachyons first, and suppose, as an example, that the dispersion relation is 



for |p| <C A tc , while for |p| > A tc the frequency is normal, u 2 > (an example of such a 
dispersion law is oo 2 = — p 2 + A t ~ 2 p 4 ). Then in expanding Universe, u scales at |p| C A ic as 



where Q is constant conformal frequency. There is a characteristic moment of time t\ in the 
history of the Universe, at which 



Before that time, would-be tachyonic modes with u(t) < A tc are over-damped and do not 
develop, so exponential growth of any mode is possible only after tA- The largest growth 
factor corresponds to modes that become tachyonic just at the time tA, i-e., modes with 



Indeed, modes of higher conformal frequency still oscillate at t = tA, while modes of lower 
conformal frequency still do not develop at t ~ tA- By now, the largest growth factor for the 
field amplitude is 



where to denotes the present time. The inhomogeneities in the tachyon field produce gravita- 
tional potentials comparable to those of ordinary matter with energy density perturbations 




(97) 



H{t A ) = A tc . 
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Figure 2: The decay of the vacuum into two ghosts and two photons via creation of a virtual 
graviton pair. 



where we estimated the pre-exponential factor on dimensional grounds and neglected redshift 
of energy when writing this factor. The bound on A tc comes from the requirement that this 
inhomogeneous energy density does not exceed observationally allowed value, say, 10~ 4 p c (the 
exact number is unimportant here). Approximating the cosmological expansion by a oc 

f 2/3 

(matter domination), we find 



5p » A 4 C ■ exp (3tJ /3 # 3 A 
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Demanding that 5p < l(T 4 p c ~ 10~ 4 M^#2 we fi n d 
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We conclude that the frequency cutoff for tachyons with dispersion relation (97) must be of 
order A tc ~ 10 5 H or lower. 

The bound on A tc rather strongly depends on the form of the dispersion relation for 
tachyons. In any case, it is somewhat higher, but not very much higher, than Hq. 

Let us now turn to ghosts. The instability in this case is due to pair creation of ghosts 
plus usual particles from vacuum, the process allowed by energy-momentum conservation 
due to negative energy of ghost particles. The strongest bound [66] on the frequency cutoff 
A g h comes from the process 

vacuum ^0 + + 7 + 7, (98) 

where and 7 denote ghost and photon, respectively. We are assuming that ghosts experi- 
ence gravitational interactions only. Then this process is described by the diagram of Fig. 2 
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and its rate per unit volume is estimated on dimensional grounds as 18 

A 8 

r ~ 9h 



M 4 Pl 



Note that all particles in (98) are on-shell, so A gh is the cutoff on energy, not the usual 
UV cutoff on momentum transfer. In Lorentz- invariant theories one has A gh = oo, and the 
rate is infinite. This corresponds to infinite volume of the Lorentz group. In other words, 
in Lorentz-invariant theories, the process (98) with certain momenta of outgoing particles 
has its boosted counterparts, so the phase space is infinite. This is not the case in Lorentz- 
violating theories. In the latter, photons created in the process (98) have energies < A g h, 
and their number density in the present Universe, and hence the flux near the Earth, is of 
order 

F ~ Tt Q . 



The flux per energy interval is 



This flux has to be smaller than the EGRET differential flux, 

= 7- 10 9 T E :, ^ ) ^cm-W^MeV" 1 . 



dE y V 450 MeV 



This requirement gives [66] 

V<3MrV. 



Hence, the frequency cutoff in Lorentz- violating theories with ghosts may be relatively high. 



4 Ghost condensate: modification of gravity without 
graviton mass 

Consider now an example of the UV-stable theory, namely, the model of the "ghost con- 
densate" [40]. The issues discussed above — the absence of the extra scalar mode near 
Minkowski space-time, protection by a residual symmetry against its reappearance in curved 
backgrounds and due to higher- derivative corrections — play a key role in the construction of 
this model. Although the graviton remains massless in the ghost condensate model, the sim- 
plicity of this model makes it a good introduction into more complicated models of massive 
gravity. 

18 We assume here that 3-momcntum cutoff is aiso of order A g h- 
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As we discussed in Section 3.2, a convenient way to modify the gravitational interaction 
in the infrared is to introduce additional scalar fields. In the simplest case this is just a single 
scalar field (cf. Section 3.4) with the action reminiscent of eq. (86), 

= A 4 J d A x^F(X) (99) 

with X given by the first of eqs. (87), X = g^d^d^/ A 4 . The field equation derived from 
this action is 

—8, [FxWy/^gTW] = , (100) 

V-g 

where Fx = dF/dX. In Minkowski space-time, and with gravity switched off, this equation 
has solution linearly growing in time ( "ghost condensate" ) 

4> v = aA 2 t , (101) 

where a is an arbitrary constant. This background obviously breaks the Lorentz symmetry. 
The time translations are also broken, but the diagonal combination of the time translations 
and shifts of by a constant remain a symmetry, so there is a conserved energy. In the 
"unitary" gauge = (fi v the variable X reduces to X = a 2 g 00 and the action (99) becomes 
the function of g 00 . This action is invariant under the space-time dependent transformations 
of spatial coordinates 

x i ^x l = x\x\t), (102) 

as discussed in the end of Section 3.3, see eq. (81). This symmetry plays an important role 
in the construction of the ghost condensate model. 

Once the back reaction of ghost condensate on gravitational background is switched on, 
the parameter a is no longer arbitrary. In order that Minkowski space be a solution to the 
Einstein equations for = <p v , the energy-momentum tensor of <f> v has to vanish, 

T, u = [2d„<f>d u <f>F x - V,uF}^ v = 0. 

This equation leads to the following two conditions (cf. (88)), 

2a 2 F x (a 2 ) - F (a 2 ) = 0, 

F(a 2 ) = 0. (103) 

The second of these two conditions is the usual tuning of the cosmological constant to zero. 
When this condition is satisfied, the first of eqs. (103) implies that F x (a 2 ) = 0. We will 
assume in what follows that that extrema of F(X) occur at X ^ 0. Thus, a does not vanish, 
and one can redefine the field in such a way that the conditions (103) are satisfied for a — 1. 
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In expanding Universe, ghost condensate is automatically driven to the point Fx = 0. 
This follows from the field equation (100). Indeed, this equation can be viewed as the 
covariant conservation equation for the current 

J" = FxWg^dvt . 

In the cosmological setting, the field is consistently taken to depend on time only, so that 
the only non- vanishing component of this current is the density J°. Its covariant conservation 
implies that, like other densities, it decays in time, 

J°ocl, 

or 

which means that Fx(X) becomes negligibly small at late times. 

In the unitary gauge and at the quadratic level in metric perturbations h^, the action 
(99) in Minkowski background becomes 

Sf = A 4 ^ J d 4 xh 2 00 = \Ml x ml J d*xh 2 00 , (104) 

where Fxx = [d 2 F/dX 2 ](a = 1) is a constant. With the Einstein-Hilbert part of the action 
added, this gives eq. (74) with all masses equal to zero except for m . Thus, at the level of 
the two- derivative action there are no propagating degrees of freedom, cf. Section 3.2. 

The same can be seen in the Stiickelberg language by replacing hoo — > 2(M piitlq) -1 doit 
in eq. (104). The resulting action for the Stiickelberg field n, 

2 J d 4 x(d n) 2 , (105) 

does not have the gradient term and describes a mode with the dispersion relation 

to 2 = 0. (106) 

If the action of the ghost condensate model contained the contribution (99) only, its effect 
would be simply (partial) gauge fixing of General Relativity, so in the sector with the initial 
condition X = 1 the theory would describe the Einstein gravity in a particular gauge. This 
situation is specific to the ghost condensate model; we will see in the next Section that in 
the generic case modifications of gravity arise already from the first term in the derivative 
expansion of the action. 

The degeneracy of the action (105) (the absence of the spatial gradient terms) signals 
that the non-propagating Stiickelberg mode can become propagating once higher-derivative 
corrections are added. These corrections are routine in the effective low energy theories, 
but here they play a crucial role. The symmetry (81) restricts the general form of these 
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corrections. In the Goldstone language, the next order contribution to the action contains 
higher derivatives acting on the Goldstone field 0, such as <9 2 and d^d^cj), suppressed by 
powers of A. When expanded to the quadratic order in 7r, these terms modify the action 
(105) to 

2 J d*x{(d nf + ^(d 2 n) 2 + ^dfrdfn + l^vr) 2 + ...}, 

where q are numerical coefficients roughly of the oder of unity. The dispersion relation for 
the Stiickelberg mode becomes 

" = A 2 "^ + A 2 ^ P + A^ P • ( ° 7) 

The are two solutions to this equation. The first one is 

oo 2 = A 2 /c + 0(p 2 ). 

This solution is irrelevant, since it falls outside of the region of validity of the low-energy 
effective theory, the latter being w< A. The second solution represents the modification of 
the dispersion relation uj 2 = which now reads 

- 2 = F p4 + °(lT)' (108) 

This solution describes a slowly propagating mode [40] which is non-tachyonic provided that 
c 2 > 0. According to (105), this mode is not a ghost for F X x(a = 1) > 1. Hence, the theory 
is healthy at high spatial momenta. 

The mode (108) modifies the gravitational interaction (in particular, the Newtonian 
potential) at distances larger than r c = 1/rriQ. On the other hand, the time scale at which 
these modifications build up is parametrically larger, t c = A/itLq [40]. The reason is again 
that the modification of gravity only occurs in the next-to-leading order in the derivative 
expansion. When the mass mo tends to zero, the scales r c and t c tend to infinity, and 
the modifications are smoothly switched off. In this sense the van Dam-Veltman-Zakharov 
phenomenon is absent in the ghost condensate model. Note, though, that according to (104), 
the mass m is related to the UV scale as 

m 2 M 2 PL = F XX A 4 . 

Hence, the limit of vanishing mass corresponds to the limit A — > (at fixed M P i), so the 
region of validity of the low energy effective theory shrinks to zero in this limit. 

The ghost condensate model does not exhibit the Boulware-Deser instability either. In 
contrast to the example considered in Section 3.5, the only scalar field n present in the 
theory has, in flat background, the dispersion relation (107), which is a consequence of 
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the residual symmetry (102). In slightly curved background, this dispersion relation may 
acquire additional terms with small coefficients controlled by the background curvature. 
The appearance of new contributions to the dispersion relation - - for instance, a term 
proportional to p 2 with a negative coefficient - - may cause tachyonic instability at low 
spatial momenta. This is precisely what happens in some cosmological backgrounds [67], 
this instability being, however, not particularly dangerous. On the other hand, the terms 
induced by slightly curved background cannot change the sign of the leading u 2 term, so the 
propagating mode does not become a ghost. The situation, therefore, is different from the 
case of the Boulware-Deser mode of Section 2.6 where one of the scalar modes is necessarily 
a ghost in curved background. 

The ghost condensate model and its modifications have unusual properties. Some of these 
properties are potentially interesting from the viewpoint of phenomenology and cosmology, 
others serve as examples of novel phenomena that may emerge once Lorentz-invariance is 
broken. Let us briefly describe some of them. 

Due to Lorentz-violation and mixing of the slowly propagating field it with metric per- 
turbations, gravitational fields of moving sources are different from gravitational fields of 
sources that are at rest with respect to ghost condensate. In particular, there is a memory 
effect: moving bodies leave "star tracks" in ghost condensate [68, 69]. 

Ghost condensate itself may be viewed as matter with rather unusual properties. In 
particular, lumps of this matter can in principle anti-gravitate [40]. More generic is the 
property that the presence of ghost condensate in space leads to an instability of the Jeans 
type with the time scale which is parametrically large as compared to ordinary fluids of the 
same energy density [40] . The latter property is related again to the presence of the slowly 
propagating mode ir. 

Non-linear dynamics of ghost condensate is also quite rich. Evolving ghost condensate 
tends to form caustics [70], much in common with caustics in some other scalar theories [71]. 
Away from the caustics, the dynamics of ghost condensate is the same as the dynamics of fluid 
with equation of state p oc p 2 . Another possible effect is the non-perturbative instability of 
the background (101) leading to the formation of microscopic "holes" of negative energy [72]. 

Lorentz-violation makes physics of black holes considerably different from that in General 
Relativity. The least dramatic effect is the accretion of the ghost condensate onto black 
holes [73, 74]. More exotic are the possibilities that black hole systems may violate the 
second law of thermodynamics [75], signals may escape from black holes [76], and black 
holes may have hair [77]. 

Cosmologically interesting class of models is obtained by adding a potential term to the 
action, so that instead of (99) one writes 
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Then both the kinetic term F(X) and the potential term V((f>) contribute to the energy- 
momentum tensor. The field <fi keeps growing, albeit not quite according to (101). This may 
be used for constructing models of inflation with ghost condensate serving as inflaton [78] 
and models for dark energy driving the present accelerated expansion of the Universe [79, 80]. 
Interestingly, the field grows even if potential increases as increases; in that case the 
field rolls up the potential. This gives rise to phantom behavior [81, 67] in which energy 
density grows in time, and the equation of state is p = wp with w < — 1 (and generically w 
depends on time). This is one of a few examples of phantom matter without UV pathologies: 
in most other cases phantom equation of state is obtained in theories with unacceptable 
tachyons and/or ghosts in UV (see, however, Refs. [82, 83]). If phantom behavior occurs at 
inflationary stage of the cosmological evolution, the consequence is the blue-tilted spectrum of 
primordial tensor perturbations (as opposed to the red-tilted spectrum predicted by theories 
where inflaton is an ordinary scalar field). The dark energy driving the present accelerated 
expansion may also have phantom equation of state, the feature potentially detectable by 
future observations of SNela (see, e.g., Ref. [84]). Perhaps the most striking possibility 
is that phantom may give rise to bouncing cosmology: in General Relativity the relation 
p < — p implies that 

H > , 

where H is the Hubble parameter, so a transition from contracting to expanding Universe 
(from H < to H > 0) becomes possible. Indeed, solutions of this sort have been found [67, 
85] and explored [86, 87] in ghost condensate models with suitable potentials V((f>). The 
bounce in these models occurs in a controllable and self-consistent way. 



5 A minimal model of massive graviton 

An interesting theory [42, 44, 43], without obvious pathologies and with massive gravitons, 
is obtained by considering the case of the residual gauge symmetry (83). This symmetry, 
x l — > x % + C'(*)> translates into the following symmetry of the Goldstone Lagrangian, 

^-^ + 5*^0), (109) 

with three arbitrary functions S\ At the one- derivative level, there are two combinations of 
the Goldstone fields that respect this symmetry, 

X = ^9 F W, 

w ij = ^{g^d^d^-g^d^d^-g^d^d^/x) 
v i v j 

yl] _ y ____ 

X ' 
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where Y % i and V 1 are defined in (87). Hence, at this level the Goldstone action is 

s^= [ d 4 x V^gF(x,w^) (no) 



where indices i,j are contracted using 5^. 
5.1 Linearized theory 

Let us first discuss this theory at the linearized level about Minkowski background. The 
background Goldstone fields are given by (84). By field redefinitions we set a = b = 1 and 
write the background fields simply as 

0° = AH, (111) 
ft = AV . (112) 

From (88) we obtain that the energy-momentum of this configuration vanishes, provided 
that 

at X = 1, = —8 l K Switching metric perturbations on, and using the unitary gauge 
<p a = <f) a , one finds that the theory about Minkowski background is gravity with the Lorentz- 
violating mass terms (74) with the only constraint 

mi — . 

Other mass parameters are independent of each other, and are expressed through F and 
its first and second derivatives at X = 1, W lj = —5^. Hence, at the level of one- derivative 
Goldstone action neither vector nor scalar sector contains propagating modes, as we discussed 
in Section 3.2.2, while tensor gravitons (two degrees of freedom) have mass mc = m 2 . In 
this sense the model can be viewed as the minimal model of massive graviton. 

It is instructive to switch off gravity and consider the Goldstone sector of this theory, in 
Minkowski background but away from the point (113). The quadratic Lagrangian for the 
perturbations tt 01 = <p a — <f) a is obtained from (110) and has the following generic form, 

At = - \{d^f + cn°d^ + ^{d^f + ^(d^r , (114) 

where b = 2(dF/dX)((f)) while other constants contain second derivatives of F at = 0. 
One observes that the fields n 1 are non-dynamical. Their equations of motion in the vector 
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sector give n Tt = 0, where ti Ti is the transverse part, di7i Tl = 0. Hence, there is no non- 
trivial modes in the vector sector even for general linearly rising background. The equation 
of motion for the longitudinal part of n l gives 

7T l = COnst • ^-7T° . 

A 

One plugs this expression into the equation of motion for rr° and obtains 

an - 6Att = , 

where a is a combination of the constants a, c, d\ and d.2- For general linearly rising Goldstone 
background, the dispersion relation is uj 2 = const • p 2 . One can see that with suitable choice 
of parameters, this mode is neither tachyon nor ghost [42, 43]. 
At the point 

dF , N 

ex =a (116 » 

the dispersion relation is u 2 = at the level of one- derivative action. In fact, this special 
point is basically coincident with the Minkowski point (113), since we neglect gravity here 
and therefore cannot discriminate between different values of F at <fi = 0. Overall, the 
situation in the scalar and vector sectors is very similar to what one finds in the ghost 
condensate theory. 

The absence of propagating modes associated with the fields ts % (rather than ir°) is by 
no means an accident. Given the background (111), the symmetry (109) implies that the 
theory is invariant under infinitesimal transformations 

7T* -> 7T* + E l {t). 

This means that at the one derivative level, the Lagrangian does not contain time derivatives 
of the fields ir\ so these fields are not dynamical. This is of course explicit in (114). So, 
the dispersion relation p 2 = 0, characteristic of non-propagating modes, is protected in this 
model by the symmetry (109). 

The latter observation is useful for the discussion of the UV sensitivity issue in this model. 
Under the assumption that higher derivative terms respect the symmetry (109), these terms 
cannot contain <9q0* (in the reference frame where the background 0° has the form (111)), 
and in terms of perturbations ir a they are quadratic combinations of 

Once these terms are added to the Lagrangian (114), the fields n l formally become dynamical, 
but it is straightforward to see that the equation for the corresponding dispersion relation 
has the form 

p 2 [uj 2 - const • A 2 + 0(p 2 )] = 
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Hence, the would-be new propagating modes have the dispersion relation 

uj 2 = const • A 2 + 0(p 2 ) . 

Since the frequencies are of order of the UV cutoff, these modes are actually absent in the 
low energy theory. In this sense the theory is UV stable: upon switching on higher-derivative 
terms, there remains the mode with the dispersion relation p 2 = only 

At this point it is worth discussing the physical interpretation of the modes with the 
dispersion relation p 2 = 0. They can be thought of as degrees of freedom with infinite 
propagation velocity (unlike the ghost condensate mode which has zero velocity at the one- 
derivative level and acquires a small velocity due to higher-derivative terms). Physically, 
they describe sound waves propagating through the rigid coordinate frame selected in space 
by the functions 0\ The rigidity of this frame is ensured by the symmetry (83) and 5*0(3) 
symmetry of the Goldstone action that allow to move and rotate this frame only as a whole. 
Infinitely fast propagating modes do not imply the violation of causality in the absence of 
Lorentz invariance, but allow for instantaneous transfer of information. The latter leads to 
a number of unusual effects related to black hole physics [75, 77]. A detailed discussion of 
the properties of these modes in a toy QED model can be found in Refs. [88, 89]. 

The higher order terms are important also for the remaining dynamical field 7r°, if the 
background obeys (115). In that case the one-derivative dispersion relation u 2 = gets 
transformed into 

to 2 = const ■ — . (116) 
A 2 

So, the spectrum of the low energy effective theory is the same as in the ghost condensate 
case, except that tensor gravitons are massive in the model discussed here. 

The symmetry (109) protects the theory from the Boulware-Deser instability as well. In 
nearly Minkowski space-time, and for the background nearly the same as in (111), (112), 
one can choose a reference frame in which the background 4>° has precisely the form (111). 
In that frame, the above analysis goes through: the fields n 1 are non-dynamical in the low 
energy effective theory, at least for u 2 , p 2 m^, and there remains one dynamical mode 
associated with the field it . Its dispersion relation coincides with (116) modulo corrections 
proportional to the deviation of the background from Minkowski space. 

5.2 Phenomenology 

By analogy to conventional field theory one might expect that non-zero graviton mass leads 
to the exponential suppression of the gravitational potential at distances greater than the 
inverse graviton mass. The latter would then be constrained by observations to be very 
small. This is not the case in the model described by the action (110), the reason being the 
violation of Lorentz-invariance. We will see below that the gravitational potential remains 
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unchanged at the linear level, at least in some region of parameter space. In this region 
the behavior of the model is similar to General Relativity in many respects and may be 
phenomenologically acceptable. At the same time, there may exist a number of interesting 
and potentially detectable effects, the non-zero graviton mass being one of them. 

5.2.1 Newton's law 

Newton's law emerges from General Relativity in the linear approximation. In order to 
derive its analog in the model described by the action (110), it is instructive to go back 
to the unitary gauge where the perturbations of the Goldstone fields are absent, and the 
only perturbations are those of metric. This will make the comparison to General Relativity 
straightforward. As in Section 2.1, it is convenient to decompose the metric perturbations 
according to eq. (1). The quadratic part of the action is then given by 

L<V = L% + L m + L s , (117) 

(2) 

where L EH , L m and L s come from the Einstein-Hilbert, mass and source terms, respectively. 
The Einstein-Hilbert term is given by eq. (6), while the mass and source terms are 

L m = M 2 Pl {~m 2 2 (^) 2 - ImKd.Fj) 2 + mfo 2 + (m 2 3 - m\) (AE) 2 - 

-2(3m 2 3 - m\)^AE + 3 (3m^ - mfj ^ 2 + 2m\AE - 6m^j, (118) 

L s = -T o (<p + d B - d 2 E) - Taip + + d Fi)T 0i + ^% (119) 

Notations for the masses are the same as in (74); the masses m 2 are combinations of the 
first and second derivatives of the function F, the parameter A and the Planck mass. As 
discussed above, their overall scale is m ~ A 2 /M P i. The source term contains an external 
energy-momentum tensor which we assume to be conserved. All combinations coupled 
to the components of T^ v are gauge invariant. The one multiplying T 00 , 

$ = + d B - d 2 E, 

plays the role of the Newtonian potential in the non-relativistic limit of General Relativity. 

In the tensor sector, only the transverse traceless perturbations hf^ are present (two 
degrees of freedom). Their field equation is that of a massive field with the mass = m 2 . 
Note that massive tensor field does not necessarily have five polarizations in a Lorentz- 
breaking theory. The examples of this phenomenon have already been discussed in the 
preceeding Sections. 

In the vector sector the field equations read 

-A(S i + d F i ) = -T 0i , (120) 
d A(Si + doFi) + mjAFi = d T 0i . (121) 
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Taking the time derivative of Eq. (120) and adding it to Eq. (121) gives 

F t = 0, 

provided that m\ 7^ 0. Thus, the vector sector of the model behaves in the same way as 
in the Einstein theory in the gauge Fi = 0. There are no propagating vector perturbations 
and interaction of sources is not modified in the vector sector unless one takes into account 
non-linear effects or higher derivative terms. 

The interaction potential between static sources ( Newton's potential) is determined by 
the scalar sector of the model. The field equations for scalar perturbations are 

2A^> + mfo + m\AE - 3m& = (122) 

2A$ - 2A^ + 6dfy - (3m 2 3 - mfj AE + 3 (3m 2 3 - m 2 ) tp - 3m\<p = (123) 

2M pl 

-2Ad 2 ^ + (ml - ml) A 2 E - (Smj - ml) Aip + m\A V = (124) 

= (125) 



2M 2 PI 



Eq. (125) implies 



1 T, 



+ = imr + +°W> (126) 

where ipo{% 1 ) is an arbitrary time-independent function. From Eqs. (122) and (124) one finds 

2m\m\ , 2{m 2 .-ml) . , . „. 

v =^r^ +± ^r^ A ^ (127 > 

2m\m\ \ 2m\ 



AE ={ 3 -^r)*- -w^ (128 > 

where 

971 = m\ — ml(ml — m\). 

Finally, substituting Eqs. (126), (127) and (128) into eq. (123) one finds the gauge-invariant 
potential $, 

(129) 



A AM 2 pl A 2 AM Pl \ 971 J A \AAM 2 pl ru ) \ 971 



The first two terms in the right hand side of eq. (129) are the standard contributions in the 
Einstein theory, the first one becoming the Newtonian potential in the nonrelativistic limit. 
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Thus, except for the -^-dependent terms, the gauge-invariant potentials $ and ip differ from 
their analogs in the Einstein theory and ipE by the mass-dependent third term in the 
right hand side of eq. (129), 



i> = *I>e, 

$ = + ( 3 - 




raj Tqq 
A 2 AM 2 ' 



(130) 



The second term in eq.(130) vanishes if all masses uniformly tend to zero, so both of the 
potentials ip and $ become the same as in General Relativity in the massless limit. This 
means the absence of the vDVZ discontinuity in the model. 

For a static source, Eq. (130) leads to the modification of the Newtonian potential of a 
point mass M which in coordinate space takes the form 



Since the potential is growing, the perturbation theory breaks down at distances r > 
l/(G ? ArM/i 2 ). This would be unacceptable for relatively large graviton masses. However, 
the modification of the potential is absent in the case 3971 = 2rn^m\ (and 9Tt 7^ 0). We will 
see in what follows that this condition can be ensured by a particular dilatation symme- 
try 19 which is automatically enforced at the cosmological attractor, i.e., at late times of the 
cosmological evolution. 

The freedom of choosing the time-independent function i])q{x) which enters the above 
gravitational potentials corresponds to the presence of the scalar mode with the dispersion 
relation oj 2 = 0. As discussed in Section 5.1, this mode is an analog of the ghost condensate 
mode and becomes dynamical with the account of higher-derivative terms in the action, 
acquiring the dispersion relation oo 2 oc p 4 . The value of ipo is determined by the initial 
conditions. In the linear regime, a non-zero value of ipo would mean the presence of the 
incoming "ghost condensate wave". So, for the purpose of finding the potential between 
sources, the physical choice is ipo(x l ) = 0. We note, however, that this choice is not so 
evident in the cosmological context. 

19 Like other symmetries which we have discussed in Sect. 3.3, this dilatation symmetry, eq. (138), may be 
viewed as an unbroken part of the diffcomorphism invariance. 




where 




(131) 
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5.2.2 Cosmological solutions 

At the time of writing this review, only spatially flat cosmological solutions are known in 
the model (110). The flat cosmological ansatz is 20 

ds 2 = dt 2 — a 2 (t)dx 2 , 

0° = (f)(t), l = AV. (132) 

For this ansatz W l i = —a~ 2 S 1 ^, so the function F in eq. (110) depends only on X and a, 
F = F(X, a). The Einstein equations reduce to the Friedman equation, 

G) 2 = ikM- + 2A ' XFx - A ' F } s + pi + P2 1 ■ (133) 

where p m is the energy density not including Goldstone fields. The field equation for 0° is 

d t (aPVXFx) = . (134) 

The field equations for <p l are satisfied automatically. In principle, it is straightforward to 
solve this system of equations for any given function F(X,a). Upon integration, eq. (134) 
gives an algebraic equation which determines X as a function of the scale factor a. This 
makes eq. (133) a closed equation for the scale factor a(t). 

From the point of view of cosmological applications, of particular interest are solutions 
where the scale factor a(t) tends to infinity at late times. Since the graviton masses are 
linear combinations of the function F(X, a) and its derivatives, one may wonder whether 
they remain finite or tend to zero in this limit, and whether the effective-theory description 
remains valid. Indeed, eq. (134) implies that at late times either X or F x tend to zero, 
which suggests that the graviton masses might tend to zero as well. 

Consider a particular class of functions F such that X(a) as found from eq. (134) asymp- 
totes to some power of a at large a. This is not a very restrictive assumption — for instance, 
it is satisfied by any algebraic function F(X,a). Then there exists a real constant 7 such 
that the combination X 1 /a 2 tends to a non-zero value as a — > 00. Eq. (134) implies that 
XFx = const • \fX/a?] this determines the dependence of the energy component p 1 on the 
scale factor, 

1 . . 

pi = const 3 _ 1/7 - (135) 



20 In principle, one may write a more general time-dependent ansatz for the scalar fields, namely, <f)' 1 = 
A 2 C(t)x z , where C(t) is an arbitrary function of time. For models respecting the symmetry (138), which we 
mainly consider in what follows, this ansatz leads to the same cosmological evolution as the ansatz (132), 
see Ref. [43] for details. 
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This relation generalizes the behavior found in the ghost condensate model where the energy 
density of the ghost condensate scales like 1/a 3 [40] (the latter behavior is recovered from 
eq. (135) at 7 — > 00). 

For 7 > 1/3 the energy density p\ behaves like the dark energy component with the the 
negative pressure. Its equation of state varies between that of cold dark matter, w = (for 
7 = +00), and that of the cosmological constant, w — — 1 (for 7 = 1/3). For < 7 < 1/3 
the term p\ grows with a. This corresponds to the energy density component with a highly 
negative equation of state, w < —1. Without fine tuning, this contribution cannot be 
canceled out by the term p 2 , so that the Hubble rate diverges as a — > 00 leading to the 
breakdown of the low-energy effective theory and rapid instabilities [90] . In what follows we 
assume that 7 does not belong to this range. For 7 < the energy density p\ corresponds 
to fluid with positive pressure. 

In order to see that the graviton masses remain finite and the effective field theory 
description is valid in the limit a — > 00, it is convenient to replace X by a new variable Z = 
X^/a 2 . The function F(X,a) becomes the function of Z and a, F{Z,a) = F(Z 1 ^a 2 ^,a). 
Note that it satisfies the relation ^ZF Z = XF X , where F z = dF/dZ. In these notations 
Eq. (134) reads 

-fa^^Z^^FziZ, a) = A, (136) 

where A is an integration constant. This equation determines Z as a function of a. By 
construction, this dependence is such that Zia — > 00) = Z , where Z is some constant. 

If one assumes further that the function F(Z, a) is regular at a — > 00, then at late times 
one has 

F(X,a) = F(Z,a)^F (Z). (137) 

In terms of the original variables this means that in the limit a — > 00 the function F(X, W^) 
depends only on the combination X 1 W l K This corresponds to the following dilatation 
symmetry of the Goldstone action, 

00 -> A0 O , 

<t>i -> A _7 0j, (138) 

which is equivalent, in the unitary gauge, to the following unbroken part of diffeomorphism 
invariance, t — > At, x l — > \~~'x\ In this case one has 

p 2 = -A 4 F (Z ), 

which behaves like a cosmological constant (assuming F (Z ) 7^ 0). Likewise, at a — > 00 the 
graviton masses become functions of Z and in general remain finite. 

The models obeying eq. (137) have an interesting feature which is a consequence of the 
symmetry (138). It is straightforward to check that Eq. (138) implies the following relations 
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among graviton masses in Minkowski space, 



ml = — 37TO4, 7(^2 — 3mg) = m\. (139) 

These relations ensure that the parameter /i 2 defined by Eq. (131) is zero, i.e., the correction 
to the Newtonian potential (the last term in eq. (130)) vanishes. Thus, apart from the effects 
of the higher derivative terms, at late times the only modification of gravity at the linear 
level is the non-zero mass of the two polarizations of the graviton. 

A particularly simple case occurs when the function F depends only on the combination 

z ij = xm ij . (140) 

If 7 > 1/3 or 7 < 0, the evolution drives the system to the point F z = 0, in full similarity 
with the ghost condensate model. In the case < 7 < 1/3 and regular F, Z tj diverges at 
large a. This breaks the validity of the low energy effective theory. 

5.2.3 Massive gravitons 

Let us consider in more detail the properties of massive gravitons, namely, the experimental 
constraints on the graviton mass and the possibility of the graviton creation in the early 
Universe. For simplicity, in this Section we limit ourselves to the model with the action 




(141) 



where Z tj are given by eq. (140). In this model there are no corrections to the Newton's 
potential at the linear level, so the tests of (linearized) gravity based on the Solar system and 
Cavendish-type experiments [91] are automatically satisfied. The constraints on the graviton 
mass come from the emission and/or propagation of gravitational waves. 

Observations of the slowdown of the orbital motion in binary pulsar systems [92] are 
considered as an indirect proof of the existence of gravitational waves. The agreement of 
these observations with General Relativity implies that the mass of the graviton cannot be 
larger than the characteristic frequency of the emitted gravitational waves. This frequency 
is set by the period of the orbital motion which is of order 10 hours, implying the following 
limit on the graviton mass, 

^ = v G < 3 • 10~ 5 Hz « (10 15 cm)' 1 ~ (70AU)- 1 . (142) 

Thus, the maximum allowed graviton mass is comparable to the inverse size of the Solar 
system, which is a very large mass (short distance) in cosmological standards. Gravitons 
of such a mass can serve as dark matter candidates provided they can be produced in 
in the early Universe in sufficient numbers. Indeed, if the graviton mass is large enough, 
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(mt;)" 1 < 1 kpc ~ 3 ■ 10 21 cm, where v ~ 10~ 3 is typical velocity in the halo, massive 
gravitons may cluster in galaxies and account for the dark matter in galactic halos. 

It is straightforward to estimate the cosmological abundance of relic massive gravitons. 
The massive gravitons are described by the transverse traceless perturbation of the metric, 
hf?. The quadratic action for (we omit the superscript in what follows) in the expanding 
Universe has the following form, 

M 2 Pl J d 3 kd V a 2 ( V ) [{h' l3 f - {d^f - mla 2 {rf)hl] , (143) 

where 77 is conformal time and prime denotes d/dr]. Eq. (143) has the form of the action 
of a minimally coupled massive scalar field. Therefore, similarly to scalar bosons, massive 
gravitons are produced efficiently during inflation (cf. Ref. [93]). 

To be concrete, consider a scenario where the Hubble parameter Hi is constant during 
inflation. This scenario may be realized, for instance, in hybrid models of inflation [94]. First, 
one has to check that the phenomenologically relevant values of parameters correspond to 
the regime below the cutoff scale of the effective theory, i.e. Hi < A. This implies for the 
energy scale of inflation ~ \JHiM Pi that 

Ei < rn^M^Ji « 10 7 GeV (m G ■ 10 15 cm) 1/4 . (144) 

This value is high enough to be consistent with everything else in cosmology (in particular, 
to allow for successful baryogenesis) even for graviton masses of the order of the current 
Hubble scale. 

Consider now the production of massive gravitons. Assuming the above scenario of 
inflation, the perturbation spectrum for the massive gravitons is that for the minimally 
coupled massive scalar field in de Sitter space [95] , 

Importantly, for long enough inflation, the present physical momenta of most of the gravitons 
are smaller than the present Hubble scale. 

Metric fluctuations remain frozen until the Hubble parameter becomes smaller than the 
graviton mass, and afterwards they start to oscillate with the amplitude decreasing as a -3 / 2 . 
The energy density in massive gravitons at the beginning of oscillations is of order 

p. ~ M 2 Pl m 2 G (h%) , 

where we neglected a pre-factor which is roughly of order 1. Today the fraction of the energy 
density in the massive gravitons is 

KPc ztPc \HJ 
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where z* is the redshift at the start of oscillations, ~ mg is the Hubble parameter at 
that time, H e ps 0.4 • 10~ 12 s _1 is the Hubble parameter at the matter/radiation equality, 
and z e 3200 is the corresponding redshift. Combining all factors together we get 



This estimate assumes that the number of e-foldings during inflation is large, In N e > H 2 /m 2 G , 
which is quite natural in the model of inflation considered here. 

According to eq. (147), massive gravitons are produced efficiently enough to comprise 
all of the cold dark matter, provided the value of the Hubble parameter during inflation is 
about one order of magnitude below the scale A. Interestingly, one obtains Q g ~ 1 when the 
initial energy density in the metric perturbations is close to the cutoff scale, p*^ 4 ~ A. This 
suggests that other mechanisms of production unrelated to inflation may naturally lead to 
the same result, Q 9 ~ 1. 

If massive gravitons have been produced in substantial amounts during the evolution 
of the Universe, they can be observed by the gravitational wave detectors. At distances 
shorter than the wavelength, the effect of a transverse traceless gravitational wave on test 
massive particles in Newtonian approximation is described by the acceleration h^x^ /2 (see, 
e.g., Ref. [96] for a review). The same is true for massive gravitational waves, the only 
difference being that the wavelengths are longer in the non-relativistic case, so the Newtonian 
description works for larger range of distances. Thus, the non-relativistic waves act on the 
detector in the same way as massless waves of the same frequency. 

To estimate the amplitude of the gravitational waves we assume that they comprise all of 
the dark matter in the halo of our Galaxy. The energy density in non-relativistic gravitational 
waves is of order M^mg/i- -. Equating this to the local halo density p ~ 0.3GeV/cm 3 one 



At frequencies 10~ 6 -j- 10~ 5 Hz this value is many orders of magnitude above the expected 
sensitivity of the LISA detector [97]. Thus, LISA may observe massive gravitational waves 
even if their abundance is much lower than that required to play the role of the dark matter. 
Note that in the nearby frequency range 10~ 9 10~ 7 Hz there is a restrictive bound [98] at 
the level £l g < 10 -9 on the stochastic background of gravitational waves coming from timing 
of millisecond pulsars [99]. So, it is possible that the model can be tested by the re-analysis of 
the already existing data on pulsar timing. This re-analysis would have to take into account 
that, unlike the usual gravitational waves, relic massive gravitons produce a monochromatic 
line at the frequency equal to the graviton mass. Such a narrow line with the relative width 
Au/u ~ 10~ 3 is a distinctive signature of the model. 




(147) 



gets 




(148) 
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Another possible signature is the time delay of a gravitational wave signal as compared 
to electromagnetic radiation. In terms of the frequency of the wave / and the distance D to 
the source, the time delay is given by 

D ( ran 

At 



2 V2vr/ 

(assuming / ^> m G ). Consider as an example gravitational waves emitted during merger of 
two massive black holes — one of the promising processes from the standpoint of gravitational 
wave detection. The frequency of these waves is of order of the gravitational radius of the 
resulting black hole, 

Mf 
2M' 

where M is the black hole mass. Thus, for m G ~ 10~ 15 cm _1 the time delay is 



/ ~ R s = 7777: 



D ( Mine \ a D \ f M\ 

At ~ 77 —T^~ ~ 5 X 10~ 6 — - — 

2 \nM*J \MpcJ \M & 



2 

S. 



This is probably too small to be detected for solar mass black holes, but may be detectable 
for heavier ones. 



5.2.4 Refined cosmological tests: growth of perturbation 

Given that some models of massive gravity pass the most obvious experimental tests, the 
question arises whether they may provide a viable alternative to General Relativity in de- 
scribing more subtle issues. One of these issues is the theory of structure formation. In the 
standard cosmology based on General Relativity, the formation of the observed structure 
in the Universe is explained by the growth of primordial perturbations, mostly during the 
matter-dominated stage (see,, e.g., [100, 101] and references therein). The conventional the- 
ory is in good agreement with observations provided the dark matter component has the right 
properties [102, 8, 103]. It is not obvious that General Relativity can be modified without 
spoiling this agreement. We demonstrate in this Section that massive gravity model de- 
scribed by the action (141) is an example of such a modification, i.e., this model successfully 
passes structure formation test even though the graviton mass is very large in cosmological 
standards. This again illustrates the fact that in Lorentz-violating theory, the mass of trans- 
verse traceless graviton has very little to do with the properties of 3-dimensionally scalar 
modes. 

Perturbations relevant for structure formation are 3-dimensional scalars. In massive 
gravity the scalar sector contains additional scalar fields which may alter the growth rate 
and make the model incompatible with observations. Without gauge fixing, the scalar sector 
contains metric perturbations (p (not to be confused with the Goldstone fields 0°, 0*), B, ip 
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and E denned according to eqs. (55) and (5), perturbations of the Goldstone fields ttq and 
til (the longitudinal part of 7Tj), and perturbations of ordinary matter. In total there are 9 
scalar perturbations of which one can form 7 gauge-invariant combinations whose dynamics 
is responsible for the structure formation. The complete set of equations that govern the 
behavior of these perturbations can be found in Ref.[104]. 

The system of equations for perturbations can be reduced to two equations for the gauge- 
invariant gravitational potentials $ and m. In General Relativity they satisfy the relation 
$ — ^ = 0. In massive gravity this relation changes to 



$ _ * = (^) 



,1/7-1 



(149) 



where d{x' 1 ) is an arbitrary function of spatial coordinates which arises as an integration 
constant. The origin of this constant is the presence of the mode with the dispersion relation 
oo 2 = 0. We have already encountered the appearance of such a constant in Sect. 5.2.1. 
The second equation is the closed equation for 



d 2 ^ 1 / 9 H'\ 8^ 1 

9 1 H' 
3 ^ + ^ + V 



H 2 



(150) 



In terms of the solutions to this equation, the density contrast is expressed as follows, 

d 



^ ( 7 A - 3H 2 ) a^-H - ^ 

Pm a Pm 



3H J 



da 



-A 



(151) 



where d is the same time-independent function of the spatial coordinates as in eq. (149). 

The standard cosmological perturbations are recovered by setting the graviton masses 
to zero, m 2 = 0. In this case one has $ — ^ = 0, i.e., $(x l ) = 0. Then the equations for 
perturbations become identical to those in the Einstein theory. Note that the function t? is 
determined by the initial conditions. Setting d = would eliminate the ^-dependent terms in 
eqs. (150) and (151) and bring these equations to the conventional form even in the case tjiq ^ 
0. Hence, there always exist initial conditions such that the model (141) exhibits the standard 
rate of perturbation growth and, therefore, is compatible with observations. Furthermore, 
at some values of the parameter 7 the part of the perturbations that is proportional to 
grows slower than the conventional part and hence it is subdominant, so the agreement with 
observations is achieved for any function $(x l ) provided it is not too large. 

In the case of matter perturbations in matter-dominated Universe eq. (150) reduces to 
the following equation, 



d 2 m 7 dm 

da 2 2a da 



7 



- 1 J a 1 ^- 3 ^ = 0, 
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which differs from the standard case by the presence of the inhomogeneous term proportional 
to The solution to this equation reads 

* = --^—a^-^x 1 ) + a^ 2 Cl (x l ) + c 2 (x*), 
I + 07 

where Ci(x l ) are the integration constants. Substituting this solution into eq. (151) one finds 
the density contrast 

K __ (^ A + 3 )^) +2 (^ -:).(,) 

+^- i (^- i h ,) - <i52) 

where po is the energy density of matter at present. The first two terms in this equation are 
precisely the ones which appear in the standard Einstein theory, the second term describing 
the linear growth of the perturbations, 5 P oc a. The difference with the conventional case is 
in the third term on the right hand side of eq. (152). The perturbations corresponding to 
this term grow proportionally to a 1 / 7 . For 7 > 1 or 7 < these "anomalous" perturbations 
grow slower than the standard ones. 

At the epoch of radiation domination the situation is similar. For relativistic fluid one 
has c 2 = w = 1/3, so that eq. (150) becomes 

a>* + « _ «|a, + (— — 1 — a v,-3„ = , ( 153 ) 



da 2 a da p r \7 p. 

where p r is the energy density of radiation at present. For generic value of 7 the solution to 
this equation is cumbersome. For simplicity let us concentrate on the modes which are shorter 
than the horizon size, k 2 3> H 2 . The density contrast calculated according to eq. (151) has 
the standard oscillating piece and the extra part proportional to 



5 P ~ ci(x l ) siny + c 2 (x L ) cosy + 27 



. , (l/7-l)/2 
Pr 



k 2 M 2 pl 



x 



ry 

+ / c\xx 1+lh sm(y-x) (154) 
Jo 



where y = rikj\f% is proportional to the scale factor, while Ci(x l ) are two integration con- 
stants. As one can see from this expression, for — 1 < 7 < the -^-dependent contribution to 
the density contrast decays in time, so that only the standard contribution remains. Thus, 
in this range of 7 the perturbations behave just as predicted by General Relativity in both 
matter and radiation-dominated epochs. 

Another case of interest is 7 = 1. This case is special because at 7 = 1 the a-dependence of 
the last term in eq. (153) disappears. In fact, one can show that in this case the dependence 
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on -d cancels out in the density contrast, so that only the standard part of perturbations 
remains. 

At other values of 7, the ^-dependent contributions to perturbations grow in the radia- 
tion dominated Universe. Whether the model of this sort is compatible with observations 
depends on the unknown function ("integration constant") ^(x 1 ). It is worth pointing out 
that this function may become slowly varying in time when higher-derivative corrections 
to the action (141) are taken into account. It remains to be understood whether these 
corrections can drive q 9{x t ) to zero during inflation, in which case the dependence on the 
initial value of d{x l ) will be eliminated and the model will be compatible with observations 
at any value of the parameter 7. 

5.2.5 Non-linear solutions: black holes 

The approach based on the Goldstone fields with the action (86) (as compared to, say, the 
Fierz-Pauli model) is fully non-linear. We have already taken advantage of this fact in Sec- 
tion 5.2.2 where we derived cosmological solutions in massive gravity. Another interesting 
question related to non-linear gravitational dynamics is the existence and properties of black 
holes. Rapid progress in observational techniques will allow for quantitative study of astro- 
physical black holes in near future, including mapping the metric near the black hole horizon 
[105, 106, 107, 108, 109, 110]. It is therefore important to understand what kind of deviations 
from General Relativity are possible at least in principle. 

Black hole properties are universal in General Relativity, in the sense that the black 
hole metric is uniquely characterized by the black hole mass and angular momentum. This 
property is related to the causal structure of the black hole space-time and is a consequence 
of the "no-hair" theorems [111, 112, 113, 114]. Thus, properties of black holes are extremely 
"resistant" to modifications. As an example, they remain unchanged in scalar-tensor theories 
[111, 112, 115, 116]. For this reason, constructing an alternative model of a black hole is a 
challenging problem. 

We will see in this Section that the black hole properties in massive gravity do differ 
from those in General Relativity. In other words, in massive gravity black holes do have 
"hair". The origin of these hair lies in the instantaneous interaction present in Lorentz- 
violating massive gravity. Their existence is thus related to the mode with the dispersion 
relation p 2 = 0, which is in turn a consequence of the symmetry (83) as has been pointed 
out in Section 5.1. For simplicity, we again limit our discussion to the particular class of 
models with the action (141). Since the presence of the instantaneous interaction is a generic 
property of Lorentz- violating massive gravities (in particular, models with the action (110)), 
we expect our conclusions to apply to a wider class of models than considered in this Section. 

The most straightforward approach to the problem would be to try to find the black hole 
solutions explicitly and to see if they differ from General Relativity black holes. However, 



65 



this appears to be a prohibitively difficult task. One may simplify the problem by asking a 
slightly different question: does massive gravity possess a black hole solution with exactly 
the same metric as in General Relativity? To answer this question one would have to find the 
configuration of the Goldstone fields such that for the given black hole metric all equations of 
motion (the Einstein equations and the equations of motion of the scalar fields) are satisfied. 
If this is possible, then the solution with the given metric exists. Alternatively, if this is not 
possible, the black hole solutions are modified in massive gravity. 

In order that the black hole metric be a solution to the Einstein equations, the energy- 
momentum tensor of the Goldstone fields 0° and (jf must vanish in the exterior of the black 
hole, 

SF f / W ij V l V^\ 

+ XWrfdrP - y (dA°9*<P - dA'd^) } ■ (155) 

where g^ v is the black hole metric. It is clear from eq. (155) that, except perhaps for some 
very special functions F, the energy-momentum tensor does not vanish as it would require 
that 10 equations are satisfied with 4 unknowns. 

Recall now that our model is constructed in such a way that the energy-momentum 
tensor of the Goldstone fields vanishes in Minkowski space. This is achieved by choosing the 
vacuum solutions for the Goldstone fields, eqs. (Ill) and (112), in such a way that eqs. (113) 
are satisfied. For the model (141) the latter equations imply 

5F 

F = °- im = 

in the Minkowski vacuum where 

W ij = -5 ij . (156) 

Thus, we can make vanish if we find a configuration of the Goldstone fields such that 
eqs. (156) are satisfied in the background metric of the black hole. 

There are fewer equations in (156) as compared to (155), but they are still too many - 
the system (156) contains 6 differential equations for only 4 unknown functions 0° and 0\ 
Consequently, if there are no degeneracies, these equations are impossible to satisfy and 
we expect that the Goldstone fields cannot be adjusted in such a way that their energy- 
momentum tensor is zero. 

An equivalent form of eq. (156) can be obtained by going into the unitary gauge. In this 
gauge eq. (156) becomes 

(<7°°) V = (157) 
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In geometrical terms, solving eq. (157) is equivalent to finding, for a given metric, the 
coordinate frame in which the constant time slices are conformally flat. This reformulation 
of eq. (156) is particularly convenient. 

In the case of the Schwarzschild black hole there actually exists a solution to eqs. (156). 
Equivalently, there exists a coordinate frame in which the spatial part of metric is conformally 
flat, the so-called Gullstrand-Painleve frame. In this frame the black hole metric has the 
form 



ds 2 = dr 2 



dx % 



BP 



x l dr 



where R s is the Schwarzschild radius of the black hole and r ■ 
configuration that solves eqs. (156) is simply 

0° = A 2 r, l = AV. 

Transforming back to the Schwarzschild coordinates one finds 



'x 2 , while the scalar field 



(158) 



0° = A 2 



t + 2^R~ s + R s In (^r 



R f 



R s 



while (j) 1 are still given by eqs. (158). Thus, the Schwarzschild black holes are solutions of 
massive gravity as well. 

The situation is different in the case of a rotating black hole: the above miracle does not 
happen and, as expected, eqs. (156) or (157) do not have solutions. In fact, conformally flat 
spatial slicings are an important ingredient in the numerical simulations of the black hole 
mergers, so their existence for various solutions of the Einstein equations has been extensively 
studied [117, 118]. In particular, it was proven that the conformally flat slicing of the Kerr 
metric is impossible due to the existence of the non-trivial invariant of the quadrupole origin 
[118] 

T = -112ttJ 2 . (159) 

Moreover, the results of Ref. [118] imply that not only the Kerr metric, but an arbitrary 
axisymmetric vacuum solution of the Einstein equations with non-zero angular momentum 
has a non-vanishing value of T and, consequently, does not allow conformally flat spatial 
slicings. Therefore, there are no configurations of the Goldstone fields such that their energy- 
momentum tensor is zero in the background of the Kerr or any other metric with non- 
zero angular momentum. Consequently, rotating black holes in massive gravity have to be 
different from the Einstein theory. 

The fact that the rotating black holes are modified in the presence of the Goldstone fields 
as compared to their General Relativity counterparts is in accord with the expectation that 
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in massive gravity black holes may have "hair" . The existence of hair can be demonstrated 
explicitly in a toy model of Lorentz-violating electrodynamics with the action [77] 

S = S EH + j d 4 xvq |f(X) - X -Fl v + m 2 G^A, j . (160) 

Here Seh is the Einstein-Hilbert action, X is given by eq. (87) while is the "effective 
metric" 

d^6°d u (f) 

Qiiv _ „liv _ Y y 

y X 

This model is analogous to massive gravity in that it possesses the instantaneous inter- 
actions [88] which are responsible for the presence of black hole hair. Moreover, one can 
show [77] that the standard charged rotating black holes are not solutions in this model, in 
full similarity with rotating black holes not being solutions in massive gravity. 

To demonstrate the existence of electromagnetic hair in the model (160) one has to show 
that there exist non-trivial static finite-energy solutions for the linearized perturbations of the 
electromagnetic field in the background of the Schwarzschild black hole. In the sector with 
the angular momentum 1=1, the vector field can be parameterized by 4 real functions of the 
radial variable p (see Ref. [77] for explicit expressions). The equation for the perturbations 
of the vector field translates into a coupled system of ordinary differential equations for the 
radial functions. One has to show that there exists a solution to this system that is regular 
both at infinity and at the black hole horizon p — > — oo. 

The existence of a regular solution can be demonstrated by counting of decreasing and 
growing modes in the asymptotic regions. Here we only present the results; details can be 
found in Ref. [77]. One can show that one of the four radial functions decouples and the 
corresponding equation does not have regular solutions. The equations for the remaining 
three radial functions can be rewritten in terms of a single fourth-order equation. Thus, 
an arbitrary solution is parameterized by four real parameters, one of which is an overall 
normalization. At infinity, there are two decreasing and two growing solutions. Requiring the 
general solution to decrease at infinity fixes two of these three parameters. At the horizon one 
finds three regular and one singular solutions. The remaining free parameter can, therefore, 
be used to eliminate the singular part and obtain the solution regular everywhere — the 
dipole hair. 

Overall, the following picture emerges. Black holes in massive gravity have no reason to 
be universal. In particular, the metric of a rotating black hole can (and must, according to 
the direct analysis) be different from that in the Einstein theory. The differences between 
different possible metrics - black hole hair - depend on the collapse history. It has been 
argued [77] that these differences, as well as the deviations from the standard metric, are of 
order one only at distances much larger than the inverse graviton mass m' 1 , and are likely 
to be suppressed by the factor ~ (ml) 2 at distances / <C m _1 , unless the parameters of the 
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model are tuned. Given the existing constraint on the graviton mass (142), in the simplest 
models the effects of the black hole hair are observable only for the largest black holes with 
masses of (a few) x 10 9 M Q . 

6 Conclusions 

To summarize, Lorentz-invariant massive gravity in 4 dimensions has severe self-consistency 
problems. It has either ghosts in the perturbation spectrum about Minkowski space, or un- 
acceptably low UV energy scale at which strong coupling sets in, plus Boulware-Deser ghost 
mode away from Minkowski background. Because of Lorentz-invariance, the pathological 
ghost modes exist at arbitrarily high spatial momenta, so vacuum in this theory is catas- 
trophically unstable. Presently, no way of curing these problems is known, and it appears 
rather unlikely that this theory can be made healthy and phenomenologically acceptable. 

Infrared modified gravities may be less problematic in theories with extra spatial dimen- 
sions and brane- worlds. Among the most widely discussed models of this sort is the DGP 
model whose normal (as opposed to self-accelerated) branch does not have ghosts in the 
spectrum and may or may not have acceptably high UV strong coupling scale. 

In this review we followed another route and discussed Lorentz- violating theories. Among 
those, we concentrated on a subclass of theories which, in the unitary gauge, have only metric 
as dynamical field, and which have Minkowski space as a solution to the field equations. 
Under these conditions, the lowest order terms in the action in Minkowski background are 
mass terms for metric perturbations. Hence, the emphasis in this review was on Lorentz- 
violating massive gravities. There is a plethora of other possibilities, some of which are 
reviewed, e.g., in Refs. [119, 38]. 

Once the spectrum of a theory is not Lorentz-invariant, ghosts, and to lesser extent 
tachyons become phenomenologically acceptable, provided they exist only at sufficiently low 
spatial momenta and energies and only weakly (e.g., gravitationally) interact with matter. 
Furthermore, some Lorentz-violating massive gravities do not have obvious pathologies at 
all, and are phenomenologically acceptable even for relatively high energy scale of Lorentz- 
violation. Unlike the Fierz-Pauli theory that has vDVZ discontinuity, at the classical level 
these theories are smooth, perturbative deformations of General Relativity, while at the 
quantum level their UV strong coupling scale is not dangerously low. The most appealing 
among these theories are the ones that leave unbroken some part of the diffeomorphism 
invariance of General Relativity, the feature that ensures the stability of these theories against 
deformation of the background and/or generation of higher order terms in the action. 

A general problem we should mention in this regard is the UV completion of these 
theories. Unlike General Relativity which is believed to be an effective low energy theory de- 
scending from string theory, massive gravities do not have obvious string theory completions. 
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We believe this issue is worth investigating in the future. 

Massive gravities of the sort we discuss in this review are conveniently analyzed by mak- 
ing use of the Stiickelberg-Goldstone formalism. This formalism involves scalar fields whose 
background values roll along either time-like or space-like directions, or both. The advan- 
tage is that the full general covariance is restored, so at energies and momenta exceeding 
the graviton mass scale the new modes, over and beyond gravitons of General Relativity, 
are perturbations of these scalar fields, which effectively decouple from gravity (except for 
the Fierz-Pauli case). In this way the spectrum of the theory is studied rather straightfor- 
wardly. Furthermore, one may view the Goldstone action as a non-linear generalization of 
the graviton mass terms, and proceed to study non-linear properties of the resulting theory, 
such as cosmology and black holes. 

Rolling scalar fields are interesting in many respects, even though their perturbations 
may be gauged away so that in the unitary gauge the theory involves metric only. In the 
cosmological context, rolling scalar fields are capable of giving rise to the late time accelerated 
expansion of the Universe, with non-trivial equation of state of the effective dark energy. It 
is worth noting that other theories with IR modified gravity are often unable to do that. As 
an example, in theories with vector field condensates, the latter may tend to constant values 
at late times. Then there is a general argument showing that the late time evolution of 
the Universe is basically the same as in General Relativity 21 , possibly with the cosmological 
constant [121, 122]. The argument goes as follows. Spatially flat, homogeneous and isotropic 
metric has the general form 

ds 2 = N 2 (t)dt 2 - a 2 (t)5 ij dx i dx j . 

This form is symmetric under time reparameterizations and space dilations, 

t -> t'{t) , x i -> Xx { (161) 

with arbitrary function t'(t) and arbitrary constant A. With matter fields settled down at 
their vacuum values (in locally Minkowski frame), the only dynamical variables are N(t) 
and a(t), and the action for these variables should respect the symmetries (161). The only 
action that is local in time, consistent with these symmetries and has no more than two time 
derivatives is 



S(N, a) = Mp t j (ft! Q - A j Ndt 



where M Pt and A need not coincide with the genuine Planck mass and cosmological con- 
stant. This action has precisely the same form as the action of General Relativity with 



21 The condensates may still evolve at the present cosmological epoch, leading to dark energy with non- 
trivial equation of state. This possibility has been explored, e.g., in Refs. [82, 120, 83]. 
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the cosmological constant, specified to homogeneous and isotropic space. No matter what 
condensates are there in the Universe, its evolution proceeds according to the Friedman 
equation, possibly with modified Newton's and cosmological constants 22 , provided that the 
condensates are independent of space-time point (in locally Minkowski frame) and consistent 
with homogeneity and isotropy of space. 

Despite apparent generality, this argument does not apply to the rolling scalar fields just 
because their background values do depend on space-time point. We gave explicit example 
of non-trivial late-time cosmological evolution in Section 5.2.2. More possibilities emerge if 
one adds a scalar potential for the rolling field(s), as we discussed in the end of Section 4. 

Lorentz- violating massive gravities have a number of other interesting features. Massive 
gravitons are candidates for dark matter particles; in that case the dark matter detection is 
a job for future (and maybe even present) gravitational wave searches. Unlike in Lorentz- 
invariant theories, black holes are expected to have reach properties. On phenomenological 
side, this opens up a possibility to search for Lorentz-violation by measuring the metrics of 
black holes in the vicinity of their horizons. From theoretical viewpoint, Lorentz- violating 
massive gravities may be employed to gain better insight into both classical and quantum 
aspects of black hole physics. The studies of these fascinating issues have started only 
recently, and one expects rapid progress in this direction. 

The authors are indebted to M. Bebronne, S. Dubovsky, S. Sibiryakov and M. Libanov 
for helpful discussions and reading of the manuscript. The work of V.R is supported in part 
by Russian Foundation for Basic Research, grant 08-02-00473. The work of P.T. is supported 
in part by the Belspo:IAP-VI/ll and IISN grants. 

A Appendix 

In this Appendix we give details of the treatment of small perturbations about cosmological 
background in Lorentz-invariant massive gravity. We use the setup and notations of Section 
2.6.1. To give explicit examples, we use the theory with the particular form of the mass 
term, which is given by (63). With the cosmological constant term included, the background 
equations in conformal time are, in general 

7Y 2 = HqO 2 + eo 
2H' + H 2 = 3H%a 2 + e s (162) 

where e (a, n) and e s (a,n) come from the mass term. As pointed out in Section 2.6.1, 
consistency of these equations implies an equation relating nirf) and a(rj), which generically 

22 In fact, the possibility that the "cosmological" Newton's constant and "Newton's law" Newton's constant 
may be different, is of phenomenological interest [123]. 
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has the form n' = f(n, a)a', but this is irrelevant for the discussion here, as a and n can 
take arbitrary values at a given moment of time. We will be interested in nearly Minkowski 
backgrounds, for which \a — 1| <C 1, \n — 1| <C 1. Since the mass term is quadratic in metric 
perturbations about Minkowski background, the source functions e and e s vanish for a = 1, 
n — 1 and near these values one has 

e , e s = 0(a - 1) + 0(n - 1) 

As an example, in the case (63) the source functions are 

eo = -\m 2 G {a 2 - l)n , e s = -^[2(a 2 - 1) + (aV - 1)] (163) 
2 In 

We begin with the range of momenta {%) in (65). In this range, the parameters H 2 , \a — 1| 
and \n — 1| are the smallest parameters in the problem; we take them formally to be of one 
and the same order. We make use of the fact that the Minkowski values for the parameters 
entering (64) are the ones in the Fierz-Pauli theory, so that 

m\ = 3m 2 G + 0(H 2 ), m\ = - l -m 2 G + 0(H 2 ) , 

^ = 3m 2 G + 0(H 2 ) , ^2 = -m 2 G + 0(H 2 ) , /i 3 = -2m 2 G + 0(H 2 ) (164) 

while the rest of the parameters are 0(H 2 ), meaning that they vanish in the Minkowski limit. 
To find the number of dynamical modes and obtain their dispersion relations, we write down 
the system of linear equations for perturbations, and calculate its determinant to order Ti 2 . 
The equations are 

up : m 2 ^ + 2HAB + (—2Aip + 6Hip' + (J,iip) + (-2TCAE' + ii 2 AE) — (165) 
B/A : 2Wy> + m|S - 2^' = (166) 

i) : (-2Av? - 67V - §W + Pi<p) + (2AS' + AKAB) 

+ (6^" - 2Ai) + m^)' + 2qi\) + m\i)) + (-2AE" - 4HAE' + fi 3 AE) = 
Ej A : (2Hip' + 2qcp + fi 2 cp) + (-2^" - AW + ^) + m\AE = 

where q = H' + 2H 2 = 0(H 2 ). We now calculate the determinant of this system to figure out 
the number of modes and their dispersion relations. We find, after going to Fourier space 
and using (164), 

1 

A' 

+ (12m 2 G H 2 + 2m 2 G m 2 ) ■ u 4 + 0{m 2 G H 2 uj 2 V 2 ) + 0(m 2 G H 2 p 4 ) (167) 



— Det = m%(3u 2 - 3p 2 - 3m G ) 



Here we assume that p 2 3> m G , u 2 3> m G , and keep only those new (with respect to 
Minkowski space) terms which are proportional to the highest power of uj. Note that there 
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have been cancellations: in particular, the terms of order 7Y 2 p 2 c<j 4 cancelled out. These 
cancellations are remnant of the gauge invariance: the terms of order 7i 2 p 2 u; 4 would be 
independent of the graviton mass, so they would remain in de Sitter space for massless 
gravitons, which would be inconsistent with gauge invariance. 

Since the determinant is of the fourth order in u, there are two modes. One of them 
has the dispersion relation of the (longitudinal component of) massive graviton. This result 
is valid in the range of momenta (i) only; indeed, the terms neglected in (167) are large at 
high momenta. We will discuss the high momentum limit later on. In the range of momenta 
under discussion here, the second, Boulware-Deser mode has u 2 ^> p 2 , and its frequency is 
given by 

(168) 



oo 2 = 



12H 2 + 2m 2 , 



The discussion here is valid for arbitrary mass term, not necessarily (63). For any mass term 
having the Fierz-Pauli form in Minkowski background, one has m 2 ~ H 2 , so that there is 
no smooth limit to Minkowski space for the frequency of the Boulware-Deser mode. This 
mode can be both tachyonic and non-tachyonic; in the example (63) this depends on the 
sign of (a — 1). Indeed, in this example m 2 = —6H 2 — 3e , so the frequency is given by 



CO 



m G 

2e 



and its sign is inverse to the sign of (a — 1), see (163). 

To see whether the Boulware-Deser mode is a ghost, we take advantage of the fact that 
its frequency (168) does not depend on spatial momenta, provided they belong to the region 
(i). Thus, to obtain the action for this mode, we can omit terms with the Laplacian in 
eqs. (165) and (166), except for the terms containing AE (here we treat AE as a field, on 
equal footing with ip). We thus obtain 



<P 



rat 



■[2H(AE - Sip)' + m 2 G (AE - 3^)] 



B = —(2ip-2Hip) 



rrii 



3m 2 ., n 2 



—m. 



where we used leading-order expressions for Hi and /i 2 , namely Hi 
Plugging these expressions back into the action (64), integrating by parts and again omitting 
terms with the spatial Laplacian and terms suppressed by the ratio 7i 2 /m 2 Gl we find the action 
for the dynamical fields ip and AE, 



7 (2) 

EH+K+mc 



1 



2M Pl / d 3 xdria 2 



6H 2 



m , 



rn 



(AE' - 3^') Z - ( AE - 3 ^) 



3m 2 , 



2m 2 
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This expression is again valid for any mass term, not necessarily (63). Here we neglected 
the terms vri 2 ^) 2 and fi^ipAE, as their contributions to the action for the Boulware-Deser 
mode are suppressed by H 2 /m G . It is clear from this action that the Boulware-Deser field 
with the dispersion relation (168) is (AE — 3ip), and that in the case when this field is not 
a tachyon, it is a ghost. Indeed, this field is not a tachyon for (QH 2 + m 2 ) < 0, which also 
implies m 2 < 0, so that the kinetic term for this field is negative. Note also that the action 
for this field is singular in the Minkowski limit, i.e., in the limit H. — > 0, in which m 2 — > 
as well. 

Let us repeat that this analysis is valid at p 2 <C m G /7i 2 only. Going to high-momentum 
limit using the expression (167) would be incorrect, partially because the terms not explicitly 
written in (167) are important at high momenta, partially because the terms of higher order 
in Tt potentially are higher order in p 2 . 

Let us now proceed to high momentum limit (ii) in (65). The analysis leading to the 
action (66) applies to any mass term. This action has the following general form, 

S ( E ] H + A +mG = 2M Pi I d'x a 2 + C^'f + Bij'AE + ^E'AE' + ^(A£) 2 } 

To proceed further, let us consider the mass term (63). After straightforward calculation 
one finds in that case 

oo o loo oo 

= —QH — 3e , m B = — 3e — -m G a n , ^2 = — 3e — m G a n 



2 

_ 3ep — e s 2 _ _ 

V 2 ~ 2H 2 ' m E--£* 



Hence 



B 



fJ>2 — m 2 B m 2 G a 2 n 

n = 2H 



,r 2 

C = 3 + 



mtp 3e 



2H 2 2H 2 

The dispersion relations are obtained by solving the equations of motion for tp and E. There 
are two modes, one with 



and another with 



2 2 
h = P 



, , 2 €s ^2 
^2 = ' P 

3e 



These expressions are valid for all a and n, not necessarily close to 1. 
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One of these modes is a tachyon or ghost. Indeed, positivity of energy requires 

A>0 , C > , m| < , m\ > . 

Now, 

so the latter requirements would imply uj\ < 0, that is, tachyon. Note that in certain 
backgrounds, the second mode is superluminal. 
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